List of Journals    /    Call For Papers    /    Subscriptions    /    Login
 
 
 
 
 SEARCH
By Author By Title
 
 
ABOUT CSC
 About CSC Journals
 CSC Journals Objectives
 List of Journals
 CALL FOR PAPERS
 Call For Papers CFP
 Special Issue CFP
AUTHOR GUIDELINES
 Submission Guidelines
 Peer Review Process
 Helpful Hints For Getting Published
 Plagiarism Policies
 Abstracting & Indexing
 Open Access Policy
 Submit Manuscript
 FOR REVIEWERS
 Reviewer Guidelines
 FOR EDITORIAL
 Editor Guidelines
 Join Us As Editor
 Launch Special Issue
 Suggest New Journal
 CSC LIBRARY
 Browse CSC Library
 Open Access Policy
  SERVICES
 Conference Partnership Program (CPP)
 Abstracting & Indexing
 SUBSCRIPTIONS
 Subscriptions
 Discounted Packages
 Archival Subscriptions
 How to Subscribe
 Librarians
 Subscriptions Agents
 Order Form
 DOWNLOADS
 
 
 
 
A Combined Voice Activity Detector Based On Singular Value Decomposition and Fourier Transform
Full text
 PDF(150.8KB)
Source 
Signal Processing: An International Journal (SPIJ)
Table of Contents
Download Complete Issue    PDF(1.86MB)
Volume:  4    Issue:  1
Pages:  1-67
Publication Date:   March 2010
ISSN (Online): 1985-2339
Pages 
54 - 61
Author(s)  
 
Published Date   
07-04-2010 
Publisher 
CSC Journals, Kuala Lumpur, Malaysia
ADDITIONAL INFORMATION
Keywords   Abstract   References   Cited by   Related Articles   Collaborative Colleague
 
KEYWORDS:   Speech, Voice Activity Detector, Singular Value 
 
 
This Manuscript is indexed in the following databases/websites:-
1. Scribd
2. Directory of Open Access Journals (DOAJ)
3. Directory of Open Access Journals (DOAJ)
4. PDFCAST
5. Docstoc
6. ResearchGATE
7. refSeek
8. Academic Index
9. Socol@r
10. iSEEK
 
 
voice activity detector (VAD) is used to separate the speech data included parts from silence parts of the signal. In this paper a new VAD algorithm is represented on the basis of singular value decomposition. There are two sections to perform the feature vector extraction. In first section voiced frames are separated from unvoiced and silence frames. In second section unvoiced frames are silence frames. To perform the above sections, first, windowing the noisy signal then Hankel’s matrix is formed for each frame. The basis of statistical feature extraction of purposed system is slope of singular value curve related to each frame by using linear regression. It is shown that the slope of singular values curve per different SNRs in voiced frames is more than the other types and this property can be to achieve the goal the first part can be used. High similarity between feature vector of unvoiced and silence frame caused to approach for separation of the two categories above cannot be used. So in the second part, the frequency characteristics for identification of unvoiced frames from silent frames have been used. Simulation results show that high speed and accuracy are the advantages of the proposed system. 
 
 
 
1 J. Ramirez, J. C. Segura, C. Benitez, A. de la Torre, A. J. Rubio, “A new Kullback-Leibler VAD for speech recognition in noise”, IEEE Signal Processing Letters, 11(2) :266– 269. 2004.
2 A. Kondoz, “Digital speech: coding for low bit rate communication systems”, J. Wiley, New York, 1994.
3 Y. Ghanbari, M. R. Karami Mollaei, “A new approach for speech enhancement based on the adaptive thresholding of the wavelet packets”, Speech Communication 48 (2006) 927–940.
4 B. V. Harsha, “A Noise Robust Activity Detection Algorithm”, proc. Of int. symposium of intelligent multimedia, video and speech processing, pp. 322-325, Oct. 2004, Hong Kon.
5 J. Ram?rez, J. C. Segura, C. Ben?tez, A. de la Torre, A. Rubio, “A New Adaptive Long-Term Spectral Estimation Voice Activity Detector,” EUROSPEECH, pp. 3041-3044, 2003, Geneva.
6 J. Faneuff, “ Spatial, Spectral, and Perceptual Nonlinear Noise Reduction for Hands-free Microphones in a Car,” Master Thesis Electrical and Computer Engineering July 2002.
7 S. Skorik, F. Berthommier, “On a Cepstrum-Based Speech Detector Robust To White Noise,” Accepted for Specom 2000, St. Petersbourg.
8 J. Stegmann, G. Schroeder, “Robust Voice Activity Detection Based on the Wavelet Transform", Proc. IEEE Workshop on Speech Coding, Sep. 1997, pp. 99-100, Pocono Manor, Pennsylvania, USA.
9 J.A. Haigh and J.S. Mason, “Robust Voice Activity Detection Using Cepstral Features,” In Proc. of IEEE TENCON'93, vol. 3, pp. 321-324, 1993, Beijng.
10 J. Shaojun, G. Hitato, Y. Fuliang, “A New Algorithm For Voice Activity Detection Based On Wavelet Transform,” proc. Of int. symposium of intelligent multimedia, video and speech processing, pp. 222-225, Oct. 2004, Hong Kong.
11 Tanyer S G, Ozer H, “ Voice activity detection in nonstationary guassian noise,” proceeding of ICSP’98 pp.1620-1623.
12 Sangwan, A., Chiranth, M. C., Jamadagni, H. S., Sah, R.,Prasad, R. V., Gaurav, V., “VAD Techniques for Real-Time Speech Transmission on the Internet”, 5th IEEE International Conference on High-Speed Networks and Multimedia Communications, pp. 46–50, 2002.
13 L. R. Rabiner, M. R. Sambur, “ Application of an LPC Distance Measure to the Voiced- Unvoiced-Silence Detection Problem,” IEEE Trans. on Acoustics, Speech, and Signal Processing, Vol. ASSP-25, No. 4, pp. 338-343, August 1977.
14 A. Vahatalo, I. Johansson, “Voice Activity Detection For GSM Adaptive Multi-Rate Codec,” IEEE 1999, pp. 55-57.
15 M.Orlandi, a. santarelli, D. Falavigna, “Maximum Likelihood endpoint Detection with timedomain features,”eurospeech 2003, Geneva, pp.1757-1760.
16 S. H. Jensen, P. C. Hansen, S. D. Hansen, “Reduction of Broad-Band Noise in Speech by Truncated QSVD,” IEEE, Trans on speech & Audio Processing, Vol.3, No.6, November 1995.
17 Y. Ghanbari, M. Karami, “Spectral subtraction in the wavelet domain for speech enhancement,” International Conference on Information Knowledge Technology (IkT2004), CD ROM, 2004.
18 H. Sameti, H. Sheikhzadeh, Li Deng, R. L.Brennan, “HMM-Based Strategies for Enhancement of Speech Signals Embedded in Nonstationary Noise”, IEEE Transactions on Speech and Audio Processing, Vol. 6, No. 5, September 1998.
19 S. Sayeed, N. S. Kamel, R. Besar, “A Sensor-Based Approach for Dynamic Signature Verification using Data Glove”. Signal Processing: An International Journal (SPIJ), 2(1)1:10, 2008.
20 J.Ghasemi, M. R. Karami Mollaei, "A New Approach for Speech Enhancement Based On Eigenvalue Spectral Subtraction" Signal Processing: An International Journal, (SPIJ) 3(4), 34:41, 2009.
21 A. Afzalian, M.R. Karami Mollaei, J. Ghasemi, " A New Approach for Speech Enhancement Based On Singular Value Decomposition and Wavelet Transform", AJBAS In Press(2010).
 
 
 
 
 
 
 
 
Jamal Ghasemi : Colleagues
Amard Afzalian : Colleagues
M.R. Karami Mollaei : Colleagues  
 
 
 
  Untitled Document
 
Copyrights (c) 2012 Computer Science Journals. All rights reserved.
Best viewed at 1152 x 864 resolution. Microsoft Internet Explorer.
 
  
 
Copyrights & Usage: Articles published by CSC Journals are Open Access. Permission to copy and distribute any other content, images, animation and other parts of this website is prohibited. CSC Journals has the rights to take action against individual/group if they are found victim of copying these parts of the website.