Home   >   CSC-OpenAccess Library   >    Manuscript Information
Full Text Available

(230.57KB)
This is an Open Access publication published under CSC-OpenAccess Policy.
Design of A Spell Corrector For Hausa Language
Lawaly Salifou, Harouna  Naroua
Pages - 14 - 26     |    Revised - 10-05-2014     |    Published - 01-06-2014
Volume - 5   Issue - 2    |    Publication Date - May / June 2014  Table of Contents
MORE INFORMATION
KEYWORDS
Natural Language Processing, Spell Checker, Spell Corrector, Computerization of Hausa, African Languages.
ABSTRACT
In this article, a spell corrector has been designed for the Hausa language which is the second most spoken language in Africa and do not yet have processing tools. This study is a contribution to the automatic processing of the Hausa language. We used existing techniques for other languages and adapted them to the special case of the Hausa language. The corrector designed operates essentially on Mijinguini’s dictionary and characteristics of the Hausa alphabet. After a brief review on spell checking and spell correcting techniques and the state of art in the Hausa language processing, we opted for the data structures trie and hash table to represent the dictionary. The edit distance and the specificities of the Hausa alphabet have been used to detect and correct spelling errors. The implementation of the spell corrector has been made on a special editor developed for that purpose (LyTexEditor) but also as an extension (add-on) for OpenOffice.org. A comparison was made on the performance of the two data structures used.
CITED BY (3)  
1 Al-Bakry, A. M., & Al-Rikaby, M. K. Enhanced Levenshtein Edit Distance Method functioning as a String-to-String Similarity Measure. Distributed Agents for Web Content Filtering, 48.
2 Negesse, F. (2015). Classification of Oromo Dialects: A Computational Approach.
3 Nijim, A., El Shenawy, A., Mostafa, M. T., & Alez, R. A. Anovel Approach for recognizing text in arabic ancient manuscripts.
1 Google Scholar
2 CiteSeerX
3 refSeek
4 Scribd
5 SlideShare
6 PdfSR
1 K. Kukich. "Techniques for automatically correcting words in text", ACM Computing Surveys, vol. 24, No. 4, 1992.
2 M.N. Pierre. "An introduction to language processing with Perl and Prolog", Springer-Verlag Berlin Heidelberg, p.2-3, 2006.
3 J.L. Peterson . "Computer Programs for Detecting and Correcting Spelling Errors", Comm.ACM, vol. 23, No. 12, December 1980.
4 V. Suzan. "Context-sensitive spell checking based on word trigram probabilities", Master thesis, February - August 2002.
5 N.A. Cyril. "String similarity and misspellings", Communications of the A.C.M., vol. 10, no.5, pp. 302-313, May 1967.
6 F.J. Damerau. "A technique for computer detection and correction of spelling errors",Comm. ACM 7, vol. 3 , pp. 171-176, March 1964.
7 L.L. Hsuan. "Spell Checkers and Correctors: a unified treatment", Master dissertation,November 2008
8 B. Laurent. "Production de logiciels et d'utilitaires pour le traitement informatique de langues africaines dans un contexte de NTIC multilingues", 2nd World Congress of Community Networks, Buenos Aires, Argentine, du 5 au 7 décembre 2001.
9 P.N. Mark. "A Comparison of Dictionary Implementations", April 10, 2009.
10 E.M. Zamora. Pollock J. J. and Antonio Z., "The use of trigram analysis for spelling error detection", Information Processing & Management, vol. 17. No. 6, pp. 305-316, 1981.
11 K. Donald. "The Art of Computer Programming", Addison-Wesley Publishing Co.,Philippines, vol. 3, 1973.
12 A.V. Aho and M.J. Corasick. "Efficient String Matching: An Aid to Bibliographic Search",Communications of the ACM, vol. 18, No. 6, pp. 333-340, June 1975.
13 G.S. Lehal and K. Singh. "A Comparative Study of Data Structures for Punjabi Dictionary",5th International Conference on Cognitive Systems, reviews & previews, ICCS’99, pp. 489-497, 2000.
14 J. Daniel and H.M. James, "Speech and Language Processing", Prentice Hall, Englewood Cliffs, Inc., 2000.
15 B. Horst. "A Fast Algorithm for Finding the Nearest Neighbor of a Word in a Dictionary",IAM-93-025, November 1993.
16 The Online Encyclopedia of Writing Systems & Language, 16/12/2013 09:31,http://www.omniglot.com/writing/definition.htm.
17 http://www.humnet.ucla.edu/humnet/aflang/Hausa/hausa.html.
18 A. Mijiguin and H. Naroua. "Règles de formation des noms en haoussa", Actes de la conférence conjointe JEP-TALN-RECITAL 2012, Atelier TALAf 2012: Traitement Automatique des Langues Africaines, pp. 63-74, 2012.
19 M.G. Maman and H.H. Seydou. "Les Langues de scolarisation dans l’enseignement fondamental en Afrique subsaharienne francophone : cas du Niger", Rapport d’étude pays,2010.
20 A.V. Van Der and D.S. Gilles-Maurice. " The African Languages on the Internet: Case Studies for Hausa, Somali, Lingala and isiXhosa", Cahiers Du Rifal, vol. 23, pp. 33–45,2003.
21 C. Bernard. "Les langues au Nigeria", Notre Librairie, Revue des littératures du Sud, Littératures du Nigéria et du Ghana, vol. 2, no. 141, pp. 8-15, 2000
22 Arrêté N°0212 MEN/SP-CNRE du 19 oct. 1999 modifiant et complétant l'arrêté n°01/MEN/SCNRE/MJSC/MESR/M.INF/MDR/MI du 15 mars 1981 relatif à l'orthographe de la langue hausa.
23 N. Ahmed. "Adaptation des écritures et de la lecture des langues étrangères au pays Haoussa de l’Afrique de l’Ouest", Synergies Algérie n°6 – 2009, pp. 61-69, 2009.
24 C. Chanard and A. Popescu-Belis. "Encodage informatique multilingue : application au contexte du Niger", Les Cahiers du Rifal, No. 22, pp. 33-45, 2001.[33]Christophe D.,"Apprendre à programmer, algorithmes et conception objet", 2e ed., Eyrolles, 2008.
25 O. Don. "Les langues africaines a l’ère du numerique, défis et opportunités de l’informatisation des langues autochtones", Les Presses de l’Université Laval, CRDI 2011.
26 G.P. Bargery. "A Hausa-English Dictionary and English-Hausa Vocabulary", Oxford University Press, London, 1934.
27 M.N. Roxana and N. Paul. "The Hausa Lexicographic Tradition", Lexikos11, AFRILEXreeks,series 11, pp. 263-286, 2001
28 A. Minjinguini. "Dictionnaire élémentaire hausa-français", les éditions GG, 2003.
29 N. Paul. "The Hausa Language An Encyclopedic Reference Grammar", Yale University Press, New Haven, 2000.
30 D.S. Gilles-Maurice. "Web for/as Corpus: A Perspective for the African Languages", Nordic Journal of African Studies, vol. 11, No. 2, pp. 266-282, 2002.
31 C. Enguehard and H. Naroua. "Evaluation of Virtual Keyboards for West-African Languages", Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08), Marrakech, Morocco, 28-30 May 2008.
32 C. Enguehard and C. Mbodj. "Des correcteurs orthographiques pour les langues africaines", Bulletin de Linguistique Appliquée et Générale, 2004.
33 D. Christophe. "Apprendre à programmer, algorithmes et conception objet", 2e ed.,Eyrolles, 2008.
34 M. Brett, P. Gary and W. David. "Head First Object-Oriented Analysis and Design",O'Reilly, Nov. 2006.
35 S. Robert and W. Kevin. "Algorithms", 4e ed., Addison Wisley, 2011
36 OpenOffice.org 3.1 Developer's Guide, https://wiki.openoffice.org.
Mr. Lawaly Salifou
Université Abdou Moumouni de Niamey - Niger
Associate Professor Harouna  Naroua
Université Abdou Moumouni de Niamey - Niger
hnaroua@yahoo.com