Home   >   CSC-OpenAccess Library   >    Manuscript Information
Full Text Available

(1.59MB)
This is an Open Access publication published under CSC-OpenAccess Policy.
Spatialization Parameter Estimation in MDCT Domain for Stereo Audio
Suresh K, Akhil Raj R
Pages - 66 - 78     |    Revised - 30-11-2015     |    Published - 31-12-2015
Volume - 9   Issue - 5    |    Publication Date - November / December 2015  Table of Contents
MORE INFORMATION
KEYWORDS
Parametric Audio Coding, MDCT, Parametric Stereo.
ABSTRACT
For representing multi-channel audio at low bit rate parametric coding techniques are used in many audio coding standards. An MDCT domain parametric stereo coding algorithm which represents the stereo channels as the linear combination of the ‘sum’ channel derived from the stereo channels and a reverberated channel generated from the ‘sum ’channel has been reported in literature. This model is inefficient in capturing the stereo image since only four parameters per sub-band is used as spatialization parameters. In this work we improve this MDCT domain parametric coder with an augmented parameter extraction scheme using an additional reverberated channel. We further modify the scheme by using orthogonalized de-correlated channels for analysis and synthesis of parametric stereo. A synthesis scheme with perceptually scaled parameter set is also introduced. Finally we present, subjective evaluation of the different parametric stereo schemes using MUSHRA test and the increased the perceptual audio quality of the synthesized signals are evident from these test results.
CITED BY (0)  
1 Google Scholar
2 CiteSeerX
3 refSeek
4 Scribd
5 SlideShare
6 PdfSR
1 C. Faller, “Parametric Coding of Spatial Audio,” Swiss Federal Institute of Technology Lausanne (EPFL), PhD Thesis, No. 3062, 2004.
2 D. Yang, H. Ai, C. Kyriakakis, ans C.C. J. Kuo, “An inter channel redundancy removal approach for high quality multichannel audio compression,” in AES convention, Los Angeles, CA, Sept 2000.
3 S. Kuo and J.D. Johnston, “A Study of Why Cross Channel Prediction is Not Applicable to Perceptual Audio Coding," IEEE Sig. Proc. Letters, vol. 8, No. 9, pp 245-247, Sep. 2001.
4 J. Herre, et.al, “The reference Model Architecture for MPEG Spatial Audio Coding," in 118th AES convention, Barcelona, Spain May 2005, Preprint 6447.
5 J.D. Johnston, and A.J. Ferreira, “Sum Difference Stereo Transform Coding,” in Proc. IEEE ICASSP-92, San Francisco, vol. 2, pp. 569-572, March 1992.
6 Christian R. Helmrich, Pontus Carlsson, Sascha Disch, Bernd Edler, Johannes Hilpert, Matthias Neusinger, Heiko Purnhagen, Nikolaus Rettelbach, Julien Robilliard, and Lars Villemoes, “Efficient Transform Coding Of Two-Channel Audio Signals By Means Of Complex-Valued Stereo Prediction,” in Proc. IEEE ICASSP-2011, pp. 497-500, 2011.
7 Christof Faller, and Frank Baumgarte, “Binaural Cue Coding: A Novel and Efficient Representation of Spatial Audio,” in Proc. IEEE ICASSP-2002, vol: 2, pp. II-1841 - II-1844, 2002.
8 F. Baumgarte, and C. Faller,“Binaural Cue Coding-part I : Psychoacoustic fundamentals and Design Principles,” in IEEE Trans. on Speech and Audio Proc., vol. 11, No. 6, pp. 509-519, June 2003.
9 F. Baumgarte, and C. Faller,“Binaural cue coding-part II : Schemes and applications,” in IEEE Trans. on Speech and Audio Proc., vol. 11, No. 6, pp. 520-531, June 2003.
10 C. Faller, “Parametric Multichannel Audio Coding: Synthesis of Coherence Cues," IEEE Trans. Speech and Audio Proc., vol. 14, No. 1, pp. 1-12, Jan. 2006.
11 J. Breebaart, et al.,“Parametric Coding of Stereo Audio,” in EURASIP Journal on Applied Signal Processing, vol 2005, No. 9, pp 1305 - 1322, June 2005.
12 A. Kohlrausch, “Auditory filter shape derived from binaural masking experiments," J. Acous. Soc. America, vol. 84, no. 2, pp. 573-583, 1988. 16
13 B. R. Glasberg and B.C.J. Moore, “Derivation of auditory filter shapes from notched-noise data," Hearing Research, vol. 47, no. 1-2, pp . 103-138, 1990.
14 K. Suresh, and T. V. Sreenivas, “MDCT Domain Analysis and Synthesis of Reverberation for Parametric Stereo Audio,” in AES 123th Convention, 2007 October 5-8, New York.
15 K. Suresh, and T. V. Sreenivas, “Parametric stereo coder with only MDCT domain computations,” IEEE International Symposium on Signal Processing and Information Technology, pp. 61-64, December 2009.
16 K. Suresh and T. V. Sreenivas, “Linear Filtering in DCT-IV/DST-IV and MDCT/MDST Domain”, Signal Processing, vol 89, Issue 6, pp 1081-1089, June 2009.
17 T. Painter, and A. Spanias, “Perceptual Coding of Digital Audio", Proc. IEEE, vol. 88, no 4, pp. 451-513, 2000.
18 K Suresh and T. V. Sreenivas, “Direct MDCT Domain Psychoacoustic Modeling”, IEEE International Symposium on Signal Processing and Information Technology, pp. 742-747, December 2007.
19 ITU/ITU-R BS 1534. Method for subjective assessment of intermediate quality level of coding systems, 2001.
Dr. Suresh K
Department of Electronics & Communication Government Engineering College Wayanad, Kerala, India, 670644 - India
suresh.kumaraswamy@gmail.com
Mr. Akhil Raj R
Department of Electronics & Communication College of Engineering, Thiruvananthapuram Kerala, India, 695016 - India