Home   >   CSC-OpenAccess Library   >    Manuscript Information
Full Text Available

(299.95KB)
This is an Open Access publication published under CSC-OpenAccess Policy.
Diacritic Oriented Arabic Information Retrieval System
Khalid Saleh Aloufi
Pages - 143 - 155     |    Revised - 31-03-2011     |    Published - 04-04-2011
Volume - 5   Issue - 1    |    Publication Date - March / April 2011  Table of Contents
MORE INFORMATION
KEYWORDS
Search Engine, Information Retrieval Performance Analysis, Arabic Language, Diacritics, Information Retrieval Systems
ABSTRACT
Arabic language support in search engines and operating systems is improved in recent years. Searching in the Internet is reliable and can be compared to the excellent support for several other languages, including English. However, for text with diacritics there are some limitations. For this reason, most Information retrieval (IR) systems remove diacritics from text and ignore it for its complexity. Searching text with diacritics is important for some kinds of documents, such as those of religious books, some newspapers and children stories. This research shows the design and development of the system that overcome the problem. The proposed system considers diacritics. The proposed system includes the design complexity in the retrieving algorithm rather than the information repository, which is database in this study. Also, this study analyses the results and the performance. Results are promising and performance analysis shows methods to enhance design and increase the performance. The proposed system can be integrated in search engines, text editors and any information retrieval system that include Arabic text. Performance analysis of the proposed system shows that this system is reliable. The proposed system is applied on database of Hadeeth, which is religious book includes the prophet action and statements. The system can be applied in any kind of data repository.
CITED BY (3)  
1 AL-Nasrawi, D. A., Almukhtar, A. F., & AL-Baldawi, W. S. (2015). From Arabic Alphabets to Two Dimension Shapes in Kufic Calligraphy Style Using Grid Board Catalog. Communications in Applied Sciences, 3(2).
2 Gutema, G. (2012). Afaan Oromo Text Retrieval System (Doctoral dissertation, AAU).
3 Gezehagn, G. E. (2012). Afaan Oromo Text Retrieval System.
1 Google Scholar
2 Academic Journals Database
3 CiteSeerX
4 iSEEK
5 Bielefeld Academic Search Engine (BASE)
6 Scribd
7 SlideShare
8 PdfSR
1 Hammo , Bassam, Mahmoud EL-Haj, Azzam Sleit (2008), Enhancing Retrieval Effectiveness of Diacritisized Arabic Passages Using Stemmer and Thesaurus: The 19th Midwest Artificial Intelligence and Cognitive Science Conference (Cincinnati, OH, USA).
2 Lazarinis , Fotis, Jesus Vilares Ferro, John Tait (2007), Improving Non-English Web Searching (iNEWS07), SIGIR Forum 41(2) 72-76
3 Oracle (2010),http://www.java.com(accessed 17 April 2010).
4 Sudeshna Sarkar (2007), Regular Expression Matching for Multi-script Databases, Bulletin on the Technical Committee on Data Engineering 30(1) 17-29.
5 Sun Microsystems (2010), MySQL 5.5 Reference Manual (2010). Available at: www.mysql.com (accessed 16 Mar 2010).
6 The Unicode Consortium (2006), Unicode Standard, Version 5.0, Addison-Wesley, 5th edition.
7 Thabet , N. (2004), Stemming the Qur'an, Proceedings of the Workshop on Computational Approaches to Arabic Script-based Languages .
8 Zaki Aldeen Almonthery (2003), Mukhtasar Saheeh Muslim.
9 Alhajjar , A., Mohammad Hajjar, Khaldoun Zreik (2009), Classification of Arabic Information Extraction methods, 2nd International Conference on Arabic Language Resources and Tools, Cairo, Egypt.
10 Beitzel, S., U. Syed, E. Jensen, O. Frieder and D. Grossman (2006), "On the Development of Name Search Techniques for Arabic", Journal of the American Society for Information Science and Technology, 57(6), pp.728 739.
11 Carol Lundquist, Ophir Frieder, David O. Holmes and David Grossman (1999), parallel relational database management system approach to relevance feedback in information retrieval, Journal of the American Society for Information Science (JASIS), 50(5);413-426.
Dr. Khalid Saleh Aloufi
Taibah University - Saudi Arabia
kalawfi@yahoo.com