|
| Segmentation of Malay Syllables in Connected Digit Speech Using Statistical Approach
|
|
Full
text: |
PDF(147.5KB) |
|
|
Source |
International Journal of Computer Science and Security (IJCSS) |
|
Table of Contents |
|
|
Download
Complete Issue PDF(1.35MB) |
|
Volume: 2 Issue: 1 |
| |
Pages: 1-86 |
|
Publication
Date: February 2008 |
|
ISSN
(Online): 1985-1553 |
|
|
|
|
|
Pages |
23 - 33 |
|
Author(s) |
|
|
|
Published
Date |
30-02-2008 |
|
Publisher |
CSC
Journals, Kuala Lumpur,
Malaysia |
|
ADDITIONAL
INFORMATION |
| Keywords Abstract References Cited by Related Articles Collaborative
Colleague |
| |
|
| |
KEYWORDS: Speech Segmentation, Divergence Algorithm, Brandt’s Algorithm |
|
|
| |
|
|
| This Manuscript is indexed in the following databases/websites:- |
|
| 1. Directory of Open Access Journals (DOAJ) |
| 2. Docstoc |
| 3. Scribd |
| 4. PDFCAST |
| 5. ScientificCommons |
| 6. WorldCat |
| 7. Google Scholar |
| 8. Academic Journals Database |
| 9. Bielefeld Academic Search Engine (BASE) |
| 10. refSeek |
| 11. Academic Index |
| 12. iSEEK |
| 13. Socol@r |
| 14. Microsoft Academic Search |
| 15. Libsearch |
| 16. slideshare |
| 17. Chinese Directory Of Open Access |
| |
|
| |
|
|
| This study present segmentation of syllables in Malay connected digit speech.
Segmentation was done in time domain signal using statistical approaches
namely the Brandt’s Generalized Likelihood Ratio (GLR) algorithm and
Divergence algorithm. These approaches basically detect abrupt changes of
energy signal in order to determine the segmentation points. Patterns used in this
experiment are connected digits of 11 speakers spoken in read mode in lab
environment and spontaneous mode in classroom environment. The aim of this
experiment is to get close match between reference points and automatic
segmentation points. Experiments were conducted to see the effect of number of
the auto regressive model order p and sliding window length L in Brandt’s
algorithm and Divergence algorithm in giving better match of the segmentation
points. This paper reports the finding of segmentation experiment using four
criterions ie. the insertion, omissions, accuracy and segmentation match between
the algorithms. The result shows that divergence algorithm performed only
slightly better and has opposite effect of the testing parameter p and L compared
to Brandt’s GLR. Read mode in comparison to spontaneous mode has better
match and less omission but less accuracy and more insertion. |
| |
|
| |
|
| |
| 1 |
Abas Lutfi. “Linguistik Deskriptif Nahu”. Dewan Bahasa dan Pustaka, Kuala Lumpur: pp. 10 - 20 (1971) |
|
|
| 2 |
H.N Ting, Y. Jasmy and S.H Salleh. “Malay Syllable Recognition Using Neural Network”. In Proceeding of the International Student Conference on Research and Development, SCOReD, Kuala Lumpur, 2006. |
|
|
| 3 |
Md Sah Hj Salam and Mohamad Nasir Said Ibrahin. “An Initial Experiment on Syllable Based Approach For Malay Digit Recognition”. In Proceeding of Advance Technology Congress. ATC2003, Putrajaya, Malaysia 2003. |
|
|
| 4 |
J.L Rouas, J. Farinas, F. Pellegrino and R.A Obrecht, “Rhythmic Unit Extraction and Modeling for Automatic Language Identification”. Speech Communication 47: 436-456. 2005. |
|
|
| 5 |
S. Jarifi, D. Pastor and O. Rosec,. “Brandt’s GLR Method & Refined HMM Segmentation for TTS Synthesis Application”. In Proceeding of European Signal Processing Conference, EUSIPCO’2005. Antalya,Turkey. 2005 |
|
|
| 6 |
B. Michele and V.N. Igor, “Detection of Abrupt Changes: Theory and Application”, Prentice Hall, INC. USA 1993 |
|
|
| 7 |
R.A. Obrecht, “Automatic Segmentation of Continuous Speech Signal”, IEEE Trans. Acoustic, Speech and Signal Processing, vol ASSP-36(1). pp 29-40, 1988 |
|
|
| 8 |
T. Jehan T. “Musical Signal Parameter Estimation”. Master Thesis, University of Rennes, France. 1997 |
|
|
| 9 |
O. Engstrand, “Sytematicity of phonetic variation in natural discourse”. Speech Communication 11, pp. 337-346. 1992 |
|
|
| 10 |
K. Kohler “Segmental reduction in connected speech in German: Phonological facts and phonetics explaination”. Speech Production and Speech Modelling, Kluwer, Dordrecht. pp.69-92. 1990 |
|
|
| 11 |
P. Cosi, J.P. Hosom, and F. Tesser. “High performance Italian continuous digit recognition”, In Proceedings of International Conference on Spoken Language Processing, Beijing, China, ICSLP 2000. |
|
|
| 12 |
Language Production and Perception, online : http://www.ling.upenn.edu/courses/Fall_1998/ling001/production_perception.html. pp. 1 – 11. |
|
|
| 13 |
T. Nuttakorn and K. Boonserm. ” A syllable - based connected Thai digit speech recognition using neural network and duration modeling”. In Proceeding of The 1999 IEEE International Symposium on Intelligent Signal Processing and Communication. Pp 785-788. 1999. |
|
|
| 14 |
W. Wei .and and S.V. Vuuren. “Improved neural Network Training of Inter-Word Context Units for Connected Digit recognition”, In Proceeding of IEEE International Conf. on Acoustics, Speech & Signal Processing, Seattle, ICASSP 1998 |
|
|
| 15 |
Md Sah Hj Salam , Dzulkifli Mohamad dan S-H Salleh.” Speech Anticipation via Genetic Optimization: An Experiment on Simulated Data”, In Proceeding of International .Conference on Artificial Intelligence in Engineering and Technology, ICAIET ’06, Kota Kinabalu, Sabah, Malaysia.2006. |
|
|
| 16 |
L.R Rabiner and M.R Sambur. “Some Preliminary Experiments in the Recognition of Connected Digits”. IEEE Trans. Acoustic, Speech and Signal Processing, vol ASSP-24. pp 170-182 April 1976. |
|
|
| |
|
| |
|
| |
| 1 |
A. V. Aquino and Y. J. A. Barria, “Change Detection in Time Series Using the Maximal Overlap Discrete Wavelet Transform”. Lat. Am. appl. res. 39(2), pp. 145-152., 2009. |
|
|
| 2 |
H. Farsi, “Improvement of Minimum Tracking in Minimum Statistics Noise Estimation Method”, Signal Processing: An International Journal (SPIJ), 4(1), pp. 17 – 22, 2010. |
|
|
| 3 |
M. S Salam, D. Mohamad and S. H Salleh, “Insertion Reduction in Speech Segmentation Using Neural Network”, in Information Technology, 2008. ITSim International Symposium, Kuala Lumpur, 26-28 Aug. 2008, pp.no. 1-7. |
|
|
| 4 |
R. Sabah and R.N. Ainon, “Isolated Digit Speech Recognition in Malay Language Using Neuro-Fuzzy Approach”, in Modelling & Simulation, AMS '09. Third Asia International Conference , Bali, 25-29 May 2009, pp.no. 336 – 340. |
|
|
| |
|
| |
|
| |
| 1 |
Department of Computer Graphics & Multimedia - Universiti Teknologi Malaysia (UTM) |
| 2 |
Universiti Teknologi Malaysia (UTM) Repository |
| |
|
| |
|
| |
|
| M-S Salam : Colleagues
|
|
| Dzulkifli Mohamad : Colleagues
|
|
| S-H Salleh : Colleagues
|
|