Home   >   CSC-OpenAccess Library   >    Manuscript Information
Full Text Available

(238.91KB)
This is an Open Access publication published under CSC-OpenAccess Policy.
Publications from CSC-OpenAccess Library are being accessed from over 74 countries worldwide.
XMODEL: An XML-based Morphological Analyzer for Arabic Language
Mourad Gridach, Noureddine Chenfour
Pages - 12 - 26     |    Revised - 30-08-2010     |    Published - 30-10-2010
Volume - 1   Issue - 2    |    Publication Date - October 2010  Table of Contents
MORE INFORMATION
KEYWORDS
NLP, Morphology, Arabic Morphological Analyzer, Morphological Automaton, XMODEL language
ABSTRACT
Morphological analysis is an essential stage in language engineering applications. For the Arabic language, this stage is not easy to develop because the Arabic language has some particularities such as the phenomena of agglutination and a lot of morphological ambiguity phenomenon. These reasons make the design of the morphological analyzer for Arabic somewhat difficult and require lots of other tools and treatments. The volume of the lexicon is another big problem of the morphological analysis of the Arabic Language which affects directly the process of the analyzing. In this paper we present a Morphological Analyzer for Modern Standard Arabic based on Arabic Morphological Automaton technique and using a new and innovative language (XMODEL) to represent the Arabic morphological knowledge in an optimal way. Both the Arabic Morphological Analyzer and Arabic Morphological Automaton are implemented in Java language and used XML technology. Buckwalter Arabic Morphological Analyzer and Xerox Arabic Finite State Morphology are two of the best known morphological analyzers for Modern Standard Arabic and they are also available and documented. Our Morphological Analyzer can be exploited by Natural Language Processing (NLP) applications such as machine translation, orthographical correction, information retrieval and both syntactic and semantic analyzers. At the end, an evaluation of Xerox and our system is done.
CITED BY (4)  
1 Tamburini, F. (2013). The AnIta-Lemmatiser: A Tool for Accurate Lemmatisation of Italian Texts. In Evaluation of Natural Language and Speech Tools for Italian (pp. 266-273). Springer Berlin Heidelberg.
2 Tamburini, F., & Melandri, M. (2012). AnIta: a powerful morphological analyser for Italian. In LREC (pp. 941-947).
3 Tamburini, F. (2011). The anita-lemmatiser. Working Notes of EVALITA.
4 Harrathi, R., Ouni, C., & Farhat, M. Impact de l’intégration de l’analyse morphologique de la langue arabe dans un système de recherche d’information open source.
1 Google Scholar 
2 CiteSeerX 
3 refSeek 
4 Scribd 
5 SlideShare 
6 PDFCAST 
7 PdfSR 
1 Abouenour L., EL Hassani S., Yazidy T., Bouzouba K., Hamdani A. “Building an Arabic Morphological Analyzer as part of an Open Arabic NLP Platform”. In the Language Resources and Evaluation Conference (LREC), Marrakech, Morocco, 31st May, 2008
2 Attia, M. (2000). “A large-scale computational processor of the Arabic Morphology and applications”. Thesis submitted to the faculty of engineering, Cairo University
3 Attia M. (2005). “Developing a Robust Arabic Morphological Transducer Using Finite State Technology”. 8th Annual CLUK Research Colloquium. Manchester, UK
4 Attia, M. (2006). “An Ambiguity-Controlled Morphological Analyzer for Modern Standard Arabic Modelling Finite State Networks”. The Challenge of Arabic for NLP/MT Conference, the British Computer Society, London
5 Atwell E., Al-Sulaiti L., Al-Osaimi S., Abu Shawar B.. (2004). “Un Examen d'Outils pour l'Analyse de Corpus Arabes”. JEP-TALN 04, Arabic Language Processing, Fès, 19-22 April 2004
6 Beesley KR (1996). “Arabic Finite-State Morphological Analysis and Generation”.Proceedings of the 16th conference on Computational linguistics, Vol 1. Copenhagen,Denmark: Association for Computational Linguistics, pp 89-94
7 Beesley KR. 1998b. “Arabic morphology using only finite-state operations”. In Michael Rosner, editor, Computational Approaches to Semitic Languages: Proceedings of the Workshop, pages 50–57, Montreal, Quebec, August 16. Université de Montreal
8 Beesley KR (2000). “Finite-State Non-Concatenative Morphotactics”. SIGPHON-2000,Proceedings of the Fifth Workshop of the ACL Special Interest Group in Computational Phonology, p. 1-12, August 6, 2000, Luxembourg
9 Buckwalter T. (2002). “Buckwalter Arabic Morphological Analyzer Version 1.0. Linguistic Data Consortium”. University of Pennsylvania, LDC Catalog No.: LDC2002L49
10 Darwish K (2002). “Building a Shallow Morphological Analyzer in One Day”. Proceedings of the workshop on Computational Approaches to Semitic Languages in the 40th Annual Meeting of the Association for Computational Linguistics (ACL-02). Philadelphia, PA, USA
11 El-Sadany, T. A., Hashish, M. A. (1989). “An Arabic Morphological System”. IBM SYSTEM JOURNAL vol 28-no 4
12 Mars M., Belgacem M. (2006). “Developed of a morphological analyser for Arabic language, tool for creation of educational activities of training of Arabic”. Workshop "TEL in working context", 13-15 November 2006, Grenoble, France. 2006
13 Mars M., Belgacem M., Zrigui M., Antoniadis G., (2007). “Analyseur morphologique de l'arabe”. CITALA2007, 18-19 juin 2007, Rabat, Maroc
14 Otakar Smrz. ElixirFM. “Implementation of Functional Arabic Morphology”. In ACL 2007 Proceedings of the Workshop on Computational Approaches to Semitic Languages:Common Issues and Resources, pages 1–8, Prague, Czech Republic, 2007.
15 Shaalan K. “Extending Prolog for Better Natural Language Analysis”. In Proceeding of the 1st Conference on Language Engineering, Egyptian Society of Language Engineering(ELSE), PP. 225-236, Egypt, March 14-15, 1998.
16 Tahir Y., Chenfour N., Harti M., “Modélisation à objets d'une base de données morphologique pour la langue arabe”. JEP-TALN 2004, Traitement Automatique de l’Arabe, Fès, 20 avril 2004.
Mr. Mourad Gridach
Faculty of Science - Morocco
mourad_i4@yahoo.fr
Professor Noureddine Chenfour
- Morocco