|
| A Novel Approach for Bilingual (English - Oriya) Script Identification and Recognition in a Printed Document
|
|
Full
text: |
PDF(763.3KB) |
|
|
Source |
International Journal of Image Processing (IJIP) |
|
Table of Contents |
|
|
Download
Complete Issue PDF(13.48MB) |
|
Volume: 4 Issue: 2 |
| |
Pages: 89-191 |
|
Publication
Date: May 2010 |
|
ISSN
(Online): 1985-2304 |
|
|
|
|
|
Pages |
175 - 191 |
|
Author(s) |
|
|
|
Published
Date |
10-06-2010 |
|
Publisher |
CSC
Journals, Kuala Lumpur,
Malaysia |
|
ADDITIONAL
INFORMATION |
| Keywords Abstract References Cited by Related Articles Collaborative
Colleague |
| |
|
| |
KEYWORDS: Script separation, Indian script, Bilingual (English-Oriya) OCR, Horizontal profiles |
|
|
| |
|
|
| This Manuscript is indexed in the following databases/websites:- |
|
| 1. Directory of Open Access Journals (DOAJ) |
| 2. PDFCAST |
| 3. Scribd |
| 4. Docstoc |
| 5. Google Scholar |
| 6. WorldCat |
| 7. Bielefeld Academic Search Engine (BASE) |
| 8. refSeek |
| 9. iSEEK |
| 10. Socol@r |
| |
|
| |
|
|
| In most of our official papers, school text books, it is observed that English words interspersed within the Indian languages. So there is need for an Optical Character Recognition (OCR) system which can recognize these bilingual documents and store it for future use. In this paper we present an OCR system developed for the recognition of Indian language i.e. Oriya and Roman scripts for printed documents. For such purpose, it is necessary to separate different scripts before feeding them to their individual OCR system. Firstly, we need to correct the skew followed by segmentation. Here we propose the script differentiation line-wise. We emphasize on Upper and lower matras associated with Oriya and absent in English. We have used horizontal histogram for line distinction belonging to different script. After separation different scripts are sent to their individual recognition engines. |
| |
|
| |
|
| |
| 1 |
A. L. Spitz. “Determination of the Script and Language Content of Document Images”. IEEE Trans. on PAMI, 235-245, 1997 |
|
|
| 2 |
J. Ding, L. Lam,and C. Y. Suen. “Classification of Oriental and European Scripts by using Characteristic Features”. In Proceedings of 4th ICDAR, pp. 1023-1027, 1997 |
|
|
| 3 |
D. Hhanya, A. G. Ramakrishna, and P. B. Pati. “ Script Identification in Printed Bilingual Documents”. Sadhana, 27(1): 73-82, 2002 |
|
|
| 4 |
J. Hochberg, P. Kelly, T. Thomas, and L. Kerns. “Automatic script Identification from Document Images using Cluster-Based Templates” IEEE Trans. on PAMI, 176-181, 1997 |
|
|
| 5 |
T. N. Tan. “Rotation Invariant Texture Features and their use in Automatic Script Identification”. IEEE Trans. On PAMI, 751-756, 1998 |
|
|
| 6 |
S. Wood, X. Yao, and K. Krishnamurthi, , L. Dang. “Language Identification for Printed Text Independent of Segmentation”. In Proc. Int’l Conf. on Image Processing. 428-431, 1995 |
|
|
| 7 |
U. Pal, and B. B Chaudhuri,. “Script Line Separation from Indian Multi-Script Documents”. IETE Journal of Research, 49, 3-11, 2003 |
|
|
| 8 |
U. Pal, S. Sinha, and B. B. Chaudhuri. “Multi-Script Line identification from Indian Documents”. In Proceedings 7th ICDAR, 880--884, 2003 |
|
|
| 9 |
S. Chanda, U. Pal, “English, Devnagari and Urdu Text Identification”. Proc. International Conference on Cognition and Recognition, 538-545, 2005 |
|
|
| 10 |
S. Mohanty, H. N. Das Bebartta, and T.K . Behera. “An Efficient Blingual Optical Character Recognition (English-Oriya) System for Printed Documents”. Seventh International Conference on Advances in Pattern Recognition, ICAPR. 398-401, 2009 |
|
|
| 11 |
R. K. Sharma, Dr. A. Singh, “Segmentation of Handwritten Text in Gurmukhi Script”. Computers & Security, 2(3):12-17, 2009 |
|
|
| 12 |
D. Suganthi, Dr. S. Purushothaman, “fMRI Segmentation Using Echo State Neural Network”. Computers & Security, 2(1):1-9, 2009 |
|
|
| 13 |
A. R. Khan, D. Muhammad, “A Simple Segmentation Approach for Unconstrained Cursive Handwritten Words in Conjunction with the Neural Network”. Computers & Security, 2(3):29- 35, 2009 |
|
|
| 14 |
S. Mohanty, and H. K. Behera.” A complete OCR Development System for Oriya Script”. Proceedings of SIMPLE’ 04, IIT Kharagpur, 2004 |
|
|
| 15 |
B. V. Dasarathy. “Nearest Neighbor Pattern Classification Techniques”. IEEE Computer Society Press,New York, 1991 |
|
|
| 16 |
V. N. Vapnik. “The Nature of Statistical LearningTheory”. Springer-Verlag, London, UK, 1995. |
|
|
| 17 |
V. N. Vapnik. “Statistical Learning Theory”. John Wiley & Sons, New York, 1998. |
|
|
| 18 |
S. Abe. “Analysis of multiclass support vector machines”. In Proceedings of International Conference on Computational Intelligence for Modelling Control and Automation (CIMCA’2003), Vienna, Austria, 2003 |
|
|
| 19 |
U. H.-G. Kreßel. “Pair wise classification and support vector machines”. In B. Sch¨olkopf, C. J. C. Burges, and A. J. Smola, editors, Advances in Kernel Methods: Support Vector Learning, pages 255– 268. The MIT Press, Cambridge, MA, 1999 |
|
|
| 20 |
J. C. Platt, N. Cristianini, and J. Shawe-Taylor. “Large margin DAGs for multiclass classification”. In S. A. Solla, T. K. Leen, and K.-R. M¨uller, editors, Advances in Neural Information Processing Systems12, pages 547–553. The MIT Press, Cambridge, MA, 2000 |
|
|
| 21 |
B. Kijsirikul and N. Ussivakul. “Multiclass support vector machines using adaptive directed acyclic Graph”. In Proceedings of International Joint Conference on Neural Networks (IJCNN 2002), 980–985, 2002 |
|
|
| 22 |
S. Abe and T. Inoue. “Fuzzy support vector machines for multiclass problems”. In Proceedings of the Tenth European Symposium on Artificial Neural Networks (ESANN”2002), 116–118, Bruges, Belgium, 2002 |
|
|
| 23 |
K. P. Bennett. Combining support vector and mathematical programming methods for classification. In B. Sch¨olkopf, C. J. C. Burges, and A. J. Smola, editors, Advances in Kernel Methods: Support Vector Learning, pages 307–326. The MIT Press, Cambridge, MA, 1999 |
|
|
| 24 |
J. Weston and C. Watkins. Support vector machinesfor multi-class pattern recognition. In Proceedings of the Seventh European Symposium on Artificial Neural Networks (ESANN’99), pages 219–224, 1999 |
|
|
| 25 |
F. Takahashi and S. Abe. “Optimizing Directed Acyclic Graph Support vector Machines”. ANNPR , Florence (Italy), September 2003 |
|
|
| |
|
| |
|
| |
| |
|
| |
|
| |
| |
|
| |
|
| |
|
| Sanghamitra Mohanty : Colleagues
|
|
| Himadri Nandini Das Bebartta : Colleagues
|
|