Home   >   CSC-OpenAccess Library   >    Manuscript Information
A Hybrid Oriya Named Entity Recognition system: Harnessing the Power of Rule
Sitanath Biswas, S. P. Mishra, S Acharya, S Mohanty
Pages - 1 - 6     |    Revised - 28-02-2010     |    Published - 01-04-2010
Volume - 1   Issue - 1    |    Publication Date - May 2010  Table of Contents
MORE INFORMATION
KEYWORDS
ABSTRACT
This paper describes a hybrid system that applies maximum entropy (MaxEnt) model with Hidden Markov model (HMM) and some linguistic rules to recognize name entities in Oriya language. The main advantage of our system is, we are using both HMM and MaxEnt model successively with some manually developed linguistic rules. First we are using MaxEnt to identify name entities in Oriya corpus, and then tagging them temporary as reference. The tagged corpus of MaxEnt now regarded as a training process for HMM. Now we use HMM for final tagging. Our approach can achieve higher precision and recall, when providing enough training data and appropriate error correction mechanism.
CITED BY (12)  
1 Patil, N., Patil, A. S., & Pawar, B. V. (2016). Survey of Named Entity Recognition Systems with respect to Indian and Foreign Languages. International Journal of Computer Applications, 134(16).
2 Das, B. R., Patnaik, S., Baboo, S., & Dash, N. S. (2015). A System for Recognition of Named Entities in Odia Text Corpus Using Machine Learning Algorithm. In Computational Intelligence in Data Mining-Volume 1 (pp. 315-324). Springer India.
3 Amarappa, S., & Sathyanarayana, S. V. kannada named entity recognition and classification (nerc) based on multinomial naïve bayes (mnb) classifier.
4 Dey, G., & Maringanti, H. B. (2014). Paninian Framework for Odia Language Processing.
5 Jimmy, L., & Kaur, D. (2013). Named entity recognition in Manipuri: a hybrid approach. In Language Processing and Knowledge in the Web (pp. 104-110). Springer Berlin Heidelberg.
6 Eboña, K. M. L., Llorca Jr, O. S., Perez, G. P., Roldan, J. M., Domingo, I. V. R., & Sagum, R. A. (2013). Named-Entity Recognizer (NER) for Filipino Novel Excerpts using Maximum Entropy Approach. Journal of Industrial and Intelligent Information Vol, 1(1).
7 Jahangir, F., Anwar, W., Bajwa, U. I., & Wang, X. (2012, December). N-gram and gazetteer list based named entity recognition for urdu: A scarce resourced language. In Proceedings of the 10th Workshop on Asian Language Resources (pp. 95-104).
8 Abdallah, S., Shaalan, K., & Shoaib, M. (2012). Integrating rule-based system with classification for Arabic named entity recognition. In Computational Linguistics and Intelligent Text Processing (pp. 311-322). Springer Berlin Heidelberg.
9 Swain, D., & Pati, C. Named Entity Disambiguation In Odia.
10 Sathyanarayana, S. A. S. A Hybrid approach for Named Entity Recognition, Classification and Extraction (NERCE) in Kannada Documents.
11 Shoaib, M. (2011). Using Machine Learning to Improve Rule based Arabic Named Entity Recognition.
12 Grando, N., Centeno, T. M., Botelho, S. S. D. C., & Fontoura, F. M. (2010). Forecasting electric energy demand using a predictor model based on liquid state machine.
1 Google Scholar 
2 Academic Index 
3 CiteSeerX 
4 refSeek 
5 Scribd 
6 PDFCAST 
7 PdfSR 
Bikel Daniel M., Miller Scott, Schwartz Richard and Weischedel Ralph. 1997. Nymble: A High Performance Learning Name-finder. In Proceedings of the Fifth Conference on Applied Natural Language Processing, 194– 201.
Borthwick Andrew. 1999. A Maximum Entropy Approach to Named Entity Recognition. Ph.D.thesis, Computer Science Department, New York University.
Cucerzan Silviu and Yarowsky David. 1999. Language Independent Named Entity Recognition Combining Morphological and Contextual Evidence. In Proceedings of the Joint SIGDAT Conference on EMNLP and VLC 1999, 90–99.
Hai Leong Chieu and Hwee Tou Ng, Named Entity Recognition with a Maximum Entropy Approach. In: Proceedings of CoNLL-2003, Edmonton, Canada, 2003, pp.160-163.
Kumarn. and Bhattacharyya Pushpak. 2006. Named Entity Recognition in Hindi using MEMM. In Technical Report, IIT Bombay,India..
Li Wei and McCallum Andrew. 2004. Rapid Development of Hindi Named Entity Recognition using Conditional Random Fields and Feature Induction (Short Paper).In ACM Transactions on Computational Logic.
McDonald R., Crammer K. and Pereira F. 2005. Flexible text segmentation with structured multilabel classification. In Proceedings of EMNLP05.
Oliver Bender, Franz Josef Och and Hermann Ney, Maximum Entropy Models for Named Entity Recognition In: Proceedings of CoNLL- 2003, Edmonton, Canada, 2003 pp.148-151.
Srihari R., Niu C. and Li W. 2000. A Hybrid Approach for Named Entity and Sub-Type Tagging. In Proceedings of the sixth conference on Applied natural language processing.
Mr. Sitanath Biswas
- India
Mr. S. P. Mishra
- India
Mr. S Acharya
- India
Mr. S Mohanty
- India


CREATE AUTHOR ACCOUNT
 
LAUNCH YOUR SPECIAL ISSUE
View all special issues >>
 
PUBLICATION VIDEOS