Home   >   CSC-OpenAccess Library   >    Manuscript Information
Full Text Available

(240.89KB)
This is an Open Access publication published under CSC-OpenAccess Policy.
Domain Specific Named Entity Recognition Using Supervised Approach
Ashwini A. Shende, Avinash J. Agrawal, Dr. O. G. Kakde
Pages - 67 - 78     |    Revised - 15-09-2012     |    Published - 24-10-2012
Volume - 3   Issue - 1    |    Publication Date - October 2012  Table of Contents
MORE INFORMATION
KEYWORDS
Named Entity , Supervised machine learning, n-gram, Context extraction, NE recognition
ABSTRACT
This paper introduces Named Entity Recognition approach for textual corpus. Supervised Statistical methods are used to develop our system. Our system can be used to categorize NEs belonging to a particular domain for which it is being trained. As Named Entities appears in text surrounded by contexts (words that are left or right of the NE), we will be focusing on extracting NE contexts from text and then perform statistical computing on them. We are using n-gram modeling for extracting contexts from text. Our methodology first extracts left and right tri-grams surrounding NE instances in the training corpus and calculate their probabilities. Then all the extracted tri-grams along with their calculated probabilities are stored in a file. During testing, system detects unrecognized NEs in the testing corpus and categorize them using the tri-gram probabilities calculated during training time. The proposed system consists of two modules namely Knowledge acquisition and NE Recognition. Knowledge acquisition module extracts the tri-grams surrounding NEs in the training corpus and NE Recognition module performs the categorization of Named Entities in the testing corpus.
CITED BY (2)  
1 Seedah, D. P. K. (2014). Retrieving information from heterogeneous freight data sources to answer natural language queries (Doctoral dissertation).
2 Agrawal, A. J., & Kakde, O. G. (2013). Semantic analysis of natural language queries using domain ontology for information access from database. International Journal of Intelligent Systems and Applications (IJISA), 5(12), 81.
1 Google Scholar
2 CiteSeerX
3 refSeek
4 Scribd
5 SlideShare
6 PdfSR
1 David Nadeau “Semi-Supervised Named Entity Recognition: Learning to Recognize 100 Entity Types with Little Supervision “
2 Mikheev, M. Moens, and C. Grover, “Named Entity Recognition without Gazetteers”, in Proceedings of Conference of European, Chapter of the Association for Computational Linguistics, EACL '99, pp. 1-8, University of Bergen, Bergen, Norway June 1999.
3 M. Collins and Y. Singer, “Unsupervised models for named entity classification”, in Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, 1999, pp. 189–196
4 G Petasis, F Vichot, F Wolinski, G Paliouras, V. Karkaletsis, and C. D. Spyropoulos, “Using machine learning to maintain rule-based named-entity recognition and classification”, in Proceedings of the 39th Annual Meeting on Association for Computational Linguistics, pp.426 – 43, Toulouse, France, 2001
5 G.S. Mann, “Fine-grained proper noun anthologies for question answering”, International Conference on Computational Linguistics, COLING-02 on SEMANET: building and using semantic networks, 2002, Vol. 11,
6 N. Fourour, and E.Morin, “Apport du Web dans la reconnaissance des entités nommées”.Revue québécoise de linguistique, 2003, vol. 32, n° 1, pp. 41-60.
7 Krstev, D. Vitas, D. Maurel, M. Tran, “Multilingual ontology of proper name”, in Proceedings of the Language and Technology Conference, pp. 116–119, Poznan, Poland, 2005
8 O. Etzioni, M. Cafarella, D. Downey, S. Kok, A. Popescu, T. Shaked, S. Soderland, D.Weld, and A.Yates, “Unsupervised named-entity extraction from the web: An experimental study”, Artificial Intelligence, 2005, vol. 65,pp. 91–134
9 N. Friburger, “Linguistique et reconnaissance automatique des noms propres”, Meta :journal des traducteurs,2006, vol. 51, n° 4, pp. 637-650
10 David Nadeau, Peter D. Turney and Stan Matwin “Unsupervised Named-Entity Recognition: Generating Gazetteers and Resolving Ambiguity”, In Proceedings of the 19th Canadian Conference on Artificial Intelligence, 2006
11 Kono Kim, Yeohoon Yoon , Harksoo Kim, and Jungyun Seo “,Named Entity Recognition Using Acyclic Weighted Digraphs: A Semi-supervised Statistical Method”, PAKDD 2007,LNAI 4426, pp. 571–578, 2007. © Springer-Verlag Berlin Heidelberg 2007.
Miss Ashwini A. Shende
RTMNU - India
zashwini@rediffmail.com
Mr. Avinash J. Agrawal
Rashtrasant Tukdoji Maharaj, Nagpur University - India
Mr. Dr. O. G. Kakde
Visvesvaraya National Institute of Technology Nagpur - India