Home   >   CSC-OpenAccess Library   >    Manuscript Information
XMODEL: An XML-based Morphological Analyzer for Arabic Language
Mourad Gridach, Noureddine Chenfour
Pages - 12 - 26     |    Revised - 30-08-2010     |    Published - 30-10-2010
Volume - 1   Issue - 2    |    Publication Date - October 2010  Table of Contents
NLP, Morphology, Arabic Morphological Analyzer, Morphological Automaton, XMODEL language
Morphological analysis is an essential stage in language engineering applications. For the Arabic language, this stage is not easy to develop because the Arabic language has some particularities such as the phenomena of agglutination and a lot of morphological ambiguity phenomenon. These reasons make the design of the morphological analyzer for Arabic somewhat difficult and require lots of other tools and treatments. The volume of the lexicon is another big problem of the morphological analysis of the Arabic Language which affects directly the process of the analyzing. In this paper we present a Morphological Analyzer for Modern Standard Arabic based on Arabic Morphological Automaton technique and using a new and innovative language (XMODEL) to represent the Arabic morphological knowledge in an optimal way. Both the Arabic Morphological Analyzer and Arabic Morphological Automaton are implemented in Java language and used XML technology. Buckwalter Arabic Morphological Analyzer and Xerox Arabic Finite State Morphology are two of the best known morphological analyzers for Modern Standard Arabic and they are also available and documented. Our Morphological Analyzer can be exploited by Natural Language Processing (NLP) applications such as machine translation, orthographical correction, information retrieval and both syntactic and semantic analyzers. At the end, an evaluation of Xerox and our system is done.
CITED BY (4)  
1 Tamburini, F. (2013). The AnIta-Lemmatiser: A Tool for Accurate Lemmatisation of Italian Texts. In Evaluation of Natural Language and Speech Tools for Italian (pp. 266-273). Springer Berlin Heidelberg.
2 Tamburini, F., & Melandri, M. (2012). AnIta: a powerful morphological analyser for Italian. In LREC (pp. 941-947).
3 Tamburini, F. (2011). The anita-lemmatiser. Working Notes of EVALITA.
4 Harrathi, R., Ouni, C., & Farhat, M. Impact de l’intégration de l’analyse morphologique de la langue arabe dans un système de recherche d’information open source.
1 Google Scholar 
2 CiteSeerX 
3 refSeek 
4 Scribd 
5 SlideShare 
7 PdfSR 
Abouenour L., EL Hassani S., Yazidy T., Bouzouba K., Hamdani A. “Building an Arabic Morphological Analyzer as part of an Open Arabic NLP Platform”. In the Language Resources and Evaluation Conference (LREC), Marrakech, Morocco, 31st May, 2008
Attia M. (2005). “Developing a Robust Arabic Morphological Transducer Using Finite State Technology”. 8th Annual CLUK Research Colloquium. Manchester, UK
Attia, M. (2000). “A large-scale computational processor of the Arabic Morphology and applications”. Thesis submitted to the faculty of engineering, Cairo University
Attia, M. (2006). “An Ambiguity-Controlled Morphological Analyzer for Modern Standard Arabic Modelling Finite State Networks”. The Challenge of Arabic for NLP/MT Conference, the British Computer Society, London
Atwell E., Al-Sulaiti L., Al-Osaimi S., Abu Shawar B.. (2004). “Un Examen d'Outils pour l'Analyse de Corpus Arabes”. JEP-TALN 04, Arabic Language Processing, Fès, 19-22 April 2004
Beesley KR (1996). “Arabic Finite-State Morphological Analysis and Generation”.Proceedings of the 16th conference on Computational linguistics, Vol 1. Copenhagen,Denmark: Association for Computational Linguistics, pp 89-94
Beesley KR (2000). “Finite-State Non-Concatenative Morphotactics”. SIGPHON-2000,Proceedings of the Fifth Workshop of the ACL Special Interest Group in Computational Phonology, p. 1-12, August 6, 2000, Luxembourg
Beesley KR. 1998b. “Arabic morphology using only finite-state operations”. In Michael Rosner, editor, Computational Approaches to Semitic Languages: Proceedings of the Workshop, pages 50–57, Montreal, Quebec, August 16. Université de Montreal
Buckwalter T. (2002). “Buckwalter Arabic Morphological Analyzer Version 1.0. Linguistic Data Consortium”. University of Pennsylvania, LDC Catalog No.: LDC2002L49
Darwish K (2002). “Building a Shallow Morphological Analyzer in One Day”. Proceedings of the workshop on Computational Approaches to Semitic Languages in the 40th Annual Meeting of the Association for Computational Linguistics (ACL-02). Philadelphia, PA, USA
El-Sadany, T. A., Hashish, M. A. (1989). “An Arabic Morphological System”. IBM SYSTEM JOURNAL vol 28-no 4
Mars M., Belgacem M. (2006). “Developed of a morphological analyser for Arabic language, tool for creation of educational activities of training of Arabic”. Workshop "TEL in working context", 13-15 November 2006, Grenoble, France. 2006
Mars M., Belgacem M., Zrigui M., Antoniadis G., (2007). “Analyseur morphologique de l'arabe”. CITALA2007, 18-19 juin 2007, Rabat, Maroc
Otakar Smrz. ElixirFM. “Implementation of Functional Arabic Morphology”. In ACL 2007 Proceedings of the Workshop on Computational Approaches to Semitic Languages:Common Issues and Resources, pages 1–8, Prague, Czech Republic, 2007.
Shaalan K. “Extending Prolog for Better Natural Language Analysis”. In Proceeding of the 1st Conference on Language Engineering, Egyptian Society of Language Engineering(ELSE), PP. 225-236, Egypt, March 14-15, 1998.
Tahir Y., Chenfour N., Harti M., “Modélisation à objets d'une base de données morphologique pour la langue arabe”. JEP-TALN 2004, Traitement Automatique de l’Arabe, Fès, 20 avril 2004.
Mr. Mourad Gridach
Faculty of Science - Morocco
Professor Noureddine Chenfour
- Morocco