Home   >   CSC-OpenAccess Library   >    Manuscript Information
Method for Real Time Text Extraction of Digital Manga Comic
Kohei Arai, Herman Tolle
Pages - 669 - 676     |    Revised - 31-01-2011     |    Published - 08-02-2011
Volume - 4   Issue - 6    |    Publication Date - January / February  Table of Contents
E-comic, Manga, Image Analysis, Text Extraction, Text Recognition
Manga is one of popular item in Japan and also in the rest of the world. Hundreds of manga book is printed everyday in Japan, and some of printed manga book is digitized into web content for reading comic through the internet. People then make translation of Japanese language in manga into other language to share enjoy of reading manga for non Japanese reader. However, people make translation of the text on printed comic book (they call it scanlation) in manually because there is no automatic method for translate comic text image into any other language. The challenge in extracting Japanese character in manga is how to detect comic balloon and extract text in vertical direction as Japanese classic writing direction is top down and right to left. Several research projects [1-4] proposed method for text extraction from images but not specific for extraction from comic image. There are two base methods for text extraction, using region based method and texture based method. In [5], propose the concept of automatic mobile content conversion using semantic image analysis that include comic text extraction, but this paper did not explain the details for text extraction. Also, Yamada [6] proposed method for comic image decomposition for reading comic on mobile phone that including comic text extraction but not details on comic text extraction. The conventional method assuming extraction process in offline way and using scanned comic image. In the internet and mobility era, we need advance method for extraction text in online way and automatically
CITED BY (31)  
1 Wang, Y., Liu, X., & Tang, Z. (2016, January). An R-CNN Based Method to Localize Speech Balloons in Comics. In MultiMedia Modeling (pp. 444-453). Springer International Publishing.
2 Rigaud, C., Le Thanh, N., Burie, J. C., Ogier, J. M., Iwata, M., Imazu, E., & Kise, K. (2015, August). Speech balloon and speaker association for comics and manga understanding. In Document Analysis and Recognition (ICDAR), 2015 13th International Conference on (pp. 351-355). IEEE.
3 HEYVAERT, P., DE NIES, T., VAN HERWEGEN, J., VANDER SANDE, M., VERBORGH, R., DE NEVE, W., ... & VAN DE WALLE, R. (2015). Using EPUB 3 and the Open Web Platform for Enhanced Presentation and Machine-Understandable Metadata for Digital Comics. ues for n the A nd Cit enness, 37.
4 Liu, X., Wang, Y., & Tang, Z. (2015, August). A clump splitting based method to localize speech balloons in comics. In Document Analysis and Recognition (ICDAR), 2015 13th International Conference on (pp. 901-905). IEEE.
5 Burie—Jean, C. R. J. C., & Ogier, M. Extraction des bulles de bandes dessinées 2.
6 Rigaud, C., Guérin, C., Karatzas, D., Burie, J. C., & Ogier, J. M. (2015). Knowledge-driven understanding of images in comic books. International Journal on Document Analysis and Recognition (IJDAR), 1-23.
7 Liu, X., Li, C., Zhu, H., Wong, T. T., & Xu, X. (2015). Text-aware balloon extraction from manga. The Visual Computer, 1-11.
8 Guérin, C., Rigaud, C., Bertet, K., Burie, J. C., Revel, A., & Ogier, J. M. (2014, June). Réduction de l'espace de recherche pour les personnages de bandes dessinées. In Reconnaissance de Formes et Intelligence Artificielle (RFIA) 2014.
9 Rigaud, C. (2014). Segmentation et indexation d’objets complexes dans les images de bandes déssinées.
10 Heyvaert, P. (2014). Enhanced Presentation and Machine-Understandable.
11 Liu, X., Shoji, K., Mori, H., & Toyama, F. (2014, February). Onomatopoeia characters extraction from comic images using constrained Delaunay triangulation. In IS&T/SPIE Electronic Imaging (pp. 90290G-90290G). International Society for Optics and Photonics.
12 Matsumiya, S., Sakti, S., Neubig, G., Toda, T., & Nakamura, S. (2014). Data-Driven Generation of Text Balloons based on Linguistic and Acoustic Features of a Comics-Anime Corpus. In Fifteenth Annual Conference of the International Speech Communication Association.
13 Rigaud, C., Karatzas, D., Burie, J. C., & Ogier, J. M. (2014). Adaptive Contour Classification of Comics Speech Balloons. In Graphics Recognition. Current Trends and Challenges (pp. 53-62). Springer Berlin Heidelberg.
14 Liu Dong, Li Lu Yuan, Wangyong Tao, soup & flag. (2014). Chinese comic dialogue automatic positioning method for unsupervised. Peking University (Natural Science), 1, 004.
15 Rigaud, C. (2014). Segmentation et indexation d’objets complexes dans les images de bandes déssinées.
16 Guerin, C. (2014). Proposed Framework for automatic analysis, interpretation and interactive search of cartoon pictures (Doctoral dissertation, University of La Rochelle).
17 Rigaud, C. (2014). Segmentation and indexation of complex objects in comic book images (Doctoral dissertation, Université de La Rochelle).
18 Arai, K. Wearable Computing System with Input-Output Devices Based on Eye-Based Human Computer Interaction Allowing Location Based Web Services.
19 Pinto, M., Puech, W., & Subsol, G. (2013, September). Protection of JPEG compressed e-comics by selective encryption. In Image Processing (ICIP), 2013 20th IEEE International Conference on (pp. 4588-4592). IEEE.
20 Tolle, H., & Arai, K. (2013, September). Manga content extraction method for automatic mobile comic content creation. In Advanced Computer Science and Information Systems (ICACSIS), 2013 International Conference on (pp. 321-328). IEEE.
21 Rigaud, C., Karatzas, D., Van De Weijer, J., Burie, J. C., & Ogier, J. M. (2013, February). Automatic text localisation in scanned comic books. In 9th International Conference on Computer Vision Theory and Applications.
22 Rigaud, C., Burie, J. C., Ogier, J. M., Karatzas, D., & Van de Weijer, J. (2013, August). An active contour model for speech balloon detection in comics. In Document Analysis and Recognition (ICDAR), 2013 12th International Conference on (pp. 1240-1244). IEEE.
23 Rigaud, C., Tsopze, N., Burie, J. C., & Ogier, J. M. (2013). Robust frame and text extraction from comic books. In Graphics Recognition. New Trends and Challenges (pp. 129-138). Springer Berlin Heidelberg.
24 Correia, J. M. C. Balloon Extraction from Complex Comic Books Using Edge Detection and Histogram Scoring (Doctoral dissertation, Universidade da Beira Interior).
25 Kohei Arai. (2012). Human-computer interaction and its application system based on the line-of-sight. Image e-Journal, 41 (3), 296-301.
26 Arai, K. (2012). Method for leaning efficiency improvements based on gaze location notifications on e-learning content screen display. International Journal of Advanced Research in Artificial Intelligence, 1(3), 1-6.
27 Rigaud, C., Tsopze, N., Burie, J. C., & Ogier, J. M. (2012, March). Extraction robuste des cases et du texte de bandes dessinées. In CIFED (pp. 349-360).
28 Ponsard, C., Ramdoyal, R., & Dziamski, D. (2012). An ocr-enabled digital comic books viewer (pp. 471-478). Springer Berlin Heidelberg.
29 Arai, K., & Tolle, H. (2012). E-comic content adaptation method for mobile phone devices.
30 Arai, K., & Tolle, H. (2011). Method for extracting product information from TV commercial. International Journal of Advanced Computer Science and Applications, Special Issue on Artificial Intelligence, 2(8), 125-131.
31 Guérin, C., Mercier, A., Rigaud, C., Tsopze, N., Bertet, K., Burie, J. C., ... & Revel, A. Ontologie d'images de bandes dessinées: utilisation de Sewelis.
1 Google Scholar 
2 CiteSeerX 
3 refSeek 
4 iSEEK 
5 Socol@r  
6 Scribd 
7 WorldCat 
8 SlideShare 
9 PdfSR 
A.K. Jain and B. Yu, Automatic text location in images and video frames. Pattern Recognition 31 12 (1998), pp. 2055–2076
Eunjung Han, et.al. “Automatic Mobile Content Conversion Using Semantic Image Analysis”, Human-Computer Interaction HCI Intelligent Multimodal Interaction Environments, LNCS 4552, Springer, Berlin, 2007
F. Chang, C-J. Chen and C-J. Lu. “A Linear-Time Component-Labeling Algorithm Using Contour Tracing Technique”, Computer Vision and Image Understanding, 93(2):pp. 206-220, 2004.
Kohei Arai, Tolle Herman, "Method for Automatic E-Comic Scene Frame Extraction for Reading Comic on Mobile Devices," itng, pp.370-375, 2010 Seventh International Conference on Information Technology, 2010
Kohei, A., Tolle, H., “Automatic E-Comic Content Adaptation”, International Journal of Ubiquitous Computing IJUC Volume (1): Issue (1), May 2010.
L. Fletcher and R. Kasturi, A robust algorithm for text string separation from mixed text/graphics images. IEEE Trans. Pattern Anal. Mach. Intell. 10 (1988), pp. 910–918
L.A. Fletcher and R. Kasturi, A robust algorithm for text string separation from mixed text/graphics images. IEEE Trans. Pattern Analysis Mach. Intell. 10 6 (1988), pp. 910–918
O. Iwaki, K. Kubota and H. Arakawa, A character/graphic segmentation method using neighborhood line density. IEICE Trans. Inform. Process. J68 4 (1985), pp. 821–828.
R. Gonzalez and R. Woods. “Digital Image Processing”, Addison-Wesley Chap.2., Publishing Company (1992)
Yamada, M., Budiarto, R. and Endoo, M., “Comic image decomposition for Reading comics on cellular phones”. IEICE transaction on information and systems, E-87-D (6):1370-1376, June 2004.
Mr. Kohei Arai
- Japan
Mr. Herman Tolle
Saga University - Japan