Home   >   CSC-OpenAccess Library   >    Manuscript Information
Full Text Available

(432.65KB)
This is an Open Access publication published under CSC-OpenAccess Policy.
Filter Bank Energy Based Malayalam Speech Segmentation and Recognition
Primekumar K.P, Sumam Mary Idiculla
Pages - 1 - 7     |    Revised - 31-12-2016     |    Published - 31-01-2017
Volume - 8   Issue - 1    |    Publication Date - February 2017  Table of Contents
MORE INFORMATION
KEYWORDS
Speech Segmentation, Filter Bank Energy, MFCC, Probabilistic Neural Networks, Hidden Markov Models.
ABSTRACT
Even though speech recognition technologies have made substantial progress, LVSR and vocabulary independent systems have not yet attained sufficient accuracy levels. For vocabulary independent speech recognition systems, segmentation of speech signal in to its constituent units such as phonemes, syllables is necessary. This paper presents a method of segmentation of spoken Malayalam words in to its constituent syllables and analyses the classification accuracy using PNN and HMM. Variations in peak filter bank energy is used for modeling criteria for segmentation. Mel Frequency Cepstral Coefficients (MFCC) and energy in each frame is used to extract the resultant feature vector in the feature extraction stage. A semi-automatic method is used for labeling the speech segments in the training phase. The system is trained using 30 samples of 26 syllables semi automatically segmented from fifty words collected from a male and female and tested on another set of fifty words containing 4720 syllables gives maximum accuracy of 74.7% and 66.77% for male and female respectively.
CITED BY (0)  
1 Google Scholar
2 CiteSeerX
3 BibSonomy
4 Doc Player
5 Scribd
6 SlideShare
7 PdfSR
1 Krishnan, V.R ; V. Jayakumar A, Anto P.B (2008) ,"Speech Recognition of isolated Malayalam words using wavelet features and Artificial Neural network, Fourth IEEE International symposium on Electronic Design, Test and Applications, 2008 volume Issue 23-25 Jan, 2008. Page(s) 240 - 243.
2 Cinikurian and Kannan Balakrishnan, "Continuous Speech Recognition System for Malayalam Language using PLP Cepstral Coefficient, IJCBR, Vol3, Issue1, Jan2012.
3 S. Young, "A review of large vocabulary continues speech recognition,"Proc.IEEE Sig. Processing. Mag. September1996, 45-57
4 Lawrence R. Rabiner. "A tutorial on HMMs and selected applications in speech recognition". Proceedings of IEEE, Vol77, No2, Feb1989.
5 Rudi Villing, Joseph Timoney, Tomas Ward and John Costello, Automatic Blind Syllable Segmentation for Continuous Speech, ISSC 2004, Belfast.
6 K.F. Chow, Tan Lee and P.C Ching, "Sub syllable Acoustic Modelling for Cantonese Speech Recognition"
7 Kaichiro Hatazaki, Yasuhiro Komori, Takeshi Kawabata and Kiyohiro Shikano, "Phoneme segmentation using spectrogram reading knowledge", IEEE,1989.
8 Md. Mijanur Rahman, Md Al-Amin Bhuiyan, "Continuous Bangla Speech Segmentation using Short-term Speech Features Extraction Approaches",IJACSA, Vol3, No11, 2012.
9 Dzmitry Pekar and Siarhei Tsikhanenka, "Speech segmentation algorithm based on an analysis of the normalized Power Spectral Density", 2010
10 Prasad, V.K nagarajan T and Murthy H.A "Automatic segmentation of continuous speech using minimum phase group delay functions", Vol.42, Apr2004, pp 1883-1886.
11 Aravind Ganapathiraju, Jonathan Hamaker, Joseph Picone, Mark Ordowski and George R Dddington, "Syllable -Based Large Vocabulary Continuous Speech Recognition", IEEE Transactions on Speech and Audio Processing, Vol9, No4, May2001.
12 Fu-Hua, Richard M Stern, Xuedong Huang,Alejandro Acero, "Efficient cepstral normalization for robust speech recognition, human language technology", 1993
13 Sergios Theodoridis and Konstantinos Koutroumbas, "Pattern Recognition", Fourth Edition
14 Marko Kos, Matej Grasic, Zdravko Kacic, " Online Speech/Music Segmentation Based on the Variance Mean of Filter Bank Energy" ,2009
15 Lawrence R. Rabiner , Biing Hwang Juans."Fundamentals of speech recognition", Pearson Education.
16 D.f specht, Probabilistic Neural Networks, neural Networks, Vol3,pp109- 118,1990.
Mr. Primekumar K.P
Cochin University of Science and Technology - India
primekumar@rediffmail.com
Mr. Sumam Mary Idiculla
Department of Computer Science Cochin University of Science and Technology Kochi, 682022, India - India