Home   >   CSC-OpenAccess Library   >    Manuscript Information
Full Text Available

This is an Open Access publication published under CSC-OpenAccess Policy.
Publications from CSC-OpenAccess Library are being accessed from over 74 countries worldwide.
The Arabic Speech Database: PADAS
Mohamed Khalil Krichi, Adnan Cherif
Pages - 10 - 19     |    Revised - 10-05-2014     |    Published - 15-06-2014
Volume - 8   Issue - 2    |    Publication Date - June 2014  Table of Contents
HTK, MAUS, Phonetic Database, Automatic Segmentation.
This work describes a construction of PADAS “Phonetics Arabic Database Automatically segmented” based on a data-driven Markov process. The use of a segmentation database is necessary in speech synthesis and recognizing speech. Manual segmentation is precise but inconsistent, since it is often produced by more than one label and require time and money. The MAUS segmentation and labeling exist for German speech and other languages but not in Arabic. It is necessary to modify MAUS for establish a segmental database for Arab. The speech corpus contains a total of 600 sentences recorded by 3 (2 male and 1 female) Arabic native speakers from Tunisia, 200 sentences for each.
1 Google Scholar 
2 CiteSeerX 
3 refSeek 
4 Scribd 
5 SlideShare 
6 PdfSR 
1 A. Black and K. Tokuda, “The Blizzard Challenge Evaluating Corpus-Based Speech Synthesis on Common Datasets,” in Proceeding of Interspeech, Portugal, pp. 77-80, 2005.
2 S. D’Arcy and M. Russell, “Experiments with the ABI (Accents of the British Isles) Speech Corpus,” in Proceedings of Interspeech 08, Australia, pp. 293-296, 2008.
3 J. Garofolo, L. Lamel , W. Fisher, J. Fiscus, D. Pallett, N. Dahlgren, and V. Zue, “TIMIT Acoustic-Phonetic Continuous Speech Corpus,” Technical Document, Trustees of the University of Pennsylvania, Philadelphia, 1993.
4 K. Tokuda, H. Zen, and A.W. Black, “An HMM-based speech synthesis system applied to English”, in IEEE Speech Synthesis Workshop, 2002.
5 M. Alghamdi, A. Alhamid, and M. Aldasuqi, “Database of Arabic Sounds: Sentences,”Technical Report, Saudi Arabia, 2003.
6 M.A. Mansour “Kacst arabic phonetics database”. Riyadh, Kingdom of Saudi Arabia. 2004.
7 G.Droua-Hamdani “Algerian Arabic speech database (algasd)”. December 2010.
8 R. Gordon, “Ethnologue: Languages of the World, Texas: Dallas”, SIL International, 2005.
9 A. Omar “Dirasat Al–Swat Al–Lugawi”.Cairo: Alam Al– Kutub 1985.
10 L. Pineda, G.mez M., D. Vaufreydaz and J. Serignat “Experiments on the Construction of a Phonetically Balanced Corpus from the Web,” in Proceedings of 5th International Conference on Computational Linguistics and Intelligent Text Processing, Lecture Notes in Computer Science, Korea, pp. 416-419, 2004.
11 L. Hadjileontiadis and S. Panas. “Separation of discontinuous adventitious sounds from vesicular sounds using a wavelet based filter”, IEEE Trans. Biomed. Eng., vol. 44, n° 7, pp.876-886, 1997.
12 S. Mallat. “A wavelet tour of signal processing”. Academic Press, 1999.
13 F. Schiel, and J. Harrington: “Phonemic Segmentation and Labelling using the MAUS Technique”. Workshop 'New Tools and Methods for Very-Large-Scale Phonetics Research',University of Pennsylvania, January 28-31, 2011.
14 F. Schiel, “MAUS Goes Iterative”. Proc. of the IV. International Conference on Language Resources and Evaluation, Lisbon, Portugal, pp. 1015-1018. 2004.
15 J.L. Fleiss “Measuring nominal scale agreement among many raters”. Psychological Bulletin,Vol. 76, No. 5 pp. 378-382. 1971.
16 S. Burger, K. Weilhammer, F. Schiel, H. G. Tillmann, “Verbmobil Data Collection and Annotation”. Foundations of Speech-to-Speech Translation (Ed.Wahlster W), Springer,Berlin, Heidelberg. 2000.
17 F. Schiel, C. Heinrich, and S. Barf¨ußer “Alcohol Language Corpus”. Language Resources,2011.
Mr. Mohamed Khalil Krichi
Faculty of Sciences of Tunis/ Laboratory of Signal Processing/ Physics Department University of Tunis-Manar TUNIS, 1060, - Tunisia
Professor Adnan Cherif
Faculty of Sciences of Tunis/ Laboratory of Signal Processing/ Physics Department University of Tunis-Manar TUNIS, 1060, - Tunisia