Home   >   CSC-OpenAccess Library   >    Manuscript Information
Spatialization Parameter Estimation in MDCT Domain for Stereo Audio
Suresh K, Akhil Raj R
Pages - 66 - 78     |    Revised - 30-11-2015     |    Published - 31-12-2015
Volume - 9   Issue - 5    |    Publication Date - November / December 2015  Table of Contents
MORE INFORMATION
KEYWORDS
Parametric Audio Coding, MDCT, Parametric Stereo.
ABSTRACT
For representing multi-channel audio at low bit rate parametric coding techniques are used in many audio coding standards. An MDCT domain parametric stereo coding algorithm which represents the stereo channels as the linear combination of the ‘sum’ channel derived from the stereo channels and a reverberated channel generated from the ‘sum ’channel has been reported in literature. This model is inefficient in capturing the stereo image since only four parameters per sub-band is used as spatialization parameters. In this work we improve this MDCT domain parametric coder with an augmented parameter extraction scheme using an additional reverberated channel. We further modify the scheme by using orthogonalized de-correlated channels for analysis and synthesis of parametric stereo. A synthesis scheme with perceptually scaled parameter set is also introduced. Finally we present, subjective evaluation of the different parametric stereo schemes using MUSHRA test and the increased the perceptual audio quality of the synthesized signals are evident from these test results.
1 Google Scholar 
2 CiteSeerX 
3 refSeek 
4 Scribd 
5 SlideShare 
6 PdfSR 
A. Kohlrausch, “Auditory filter shape derived from binaural masking experiments," J. Acous. Soc. America, vol. 84, no. 2, pp. 573-583, 1988. 16
B. R. Glasberg and B.C.J. Moore, “Derivation of auditory filter shapes from notched-noise data," Hearing Research, vol. 47, no. 1-2, pp . 103-138, 1990.
C. Faller, “Parametric Coding of Spatial Audio,” Swiss Federal Institute of Technology Lausanne (EPFL), PhD Thesis, No. 3062, 2004.
C. Faller, “Parametric Multichannel Audio Coding: Synthesis of Coherence Cues," IEEE Trans. Speech and Audio Proc., vol. 14, No. 1, pp. 1-12, Jan. 2006.
Christian R. Helmrich, Pontus Carlsson, Sascha Disch, Bernd Edler, Johannes Hilpert, Matthias Neusinger, Heiko Purnhagen, Nikolaus Rettelbach, Julien Robilliard, and Lars Villemoes, “Efficient Transform Coding Of Two-Channel Audio Signals By Means Of Complex-Valued Stereo Prediction,” in Proc. IEEE ICASSP-2011, pp. 497-500, 2011.
Christof Faller, and Frank Baumgarte, “Binaural Cue Coding: A Novel and Efficient Representation of Spatial Audio,” in Proc. IEEE ICASSP-2002, vol: 2, pp. II-1841 - II-1844, 2002.
D. Yang, H. Ai, C. Kyriakakis, ans C.C. J. Kuo, “An inter channel redundancy removal approach for high quality multichannel audio compression,” in AES convention, Los Angeles, CA, Sept 2000.
F. Baumgarte, and C. Faller,“Binaural Cue Coding-part I : Psychoacoustic fundamentals and Design Principles,” in IEEE Trans. on Speech and Audio Proc., vol. 11, No. 6, pp. 509-519, June 2003.
F. Baumgarte, and C. Faller,“Binaural cue coding-part II : Schemes and applications,” in IEEE Trans. on Speech and Audio Proc., vol. 11, No. 6, pp. 520-531, June 2003.
ITU/ITU-R BS 1534. Method for subjective assessment of intermediate quality level of coding systems, 2001.
J. Breebaart, et al.,“Parametric Coding of Stereo Audio,” in EURASIP Journal on Applied Signal Processing, vol 2005, No. 9, pp 1305 - 1322, June 2005.
J. Herre, et.al, “The reference Model Architecture for MPEG Spatial Audio Coding," in 118th AES convention, Barcelona, Spain May 2005, Preprint 6447.
J.D. Johnston, and A.J. Ferreira, “Sum Difference Stereo Transform Coding,” in Proc. IEEE ICASSP-92, San Francisco, vol. 2, pp. 569-572, March 1992.
K Suresh and T. V. Sreenivas, “Direct MDCT Domain Psychoacoustic Modeling”, IEEE International Symposium on Signal Processing and Information Technology, pp. 742-747, December 2007.
K. Suresh and T. V. Sreenivas, “Linear Filtering in DCT-IV/DST-IV and MDCT/MDST Domain”, Signal Processing, vol 89, Issue 6, pp 1081-1089, June 2009.
K. Suresh, and T. V. Sreenivas, “MDCT Domain Analysis and Synthesis of Reverberation for Parametric Stereo Audio,” in AES 123th Convention, 2007 October 5-8, New York.
K. Suresh, and T. V. Sreenivas, “Parametric stereo coder with only MDCT domain computations,” IEEE International Symposium on Signal Processing and Information Technology, pp. 61-64, December 2009.
S. Kuo and J.D. Johnston, “A Study of Why Cross Channel Prediction is Not Applicable to Perceptual Audio Coding," IEEE Sig. Proc. Letters, vol. 8, No. 9, pp 245-247, Sep. 2001.
T. Painter, and A. Spanias, “Perceptual Coding of Digital Audio", Proc. IEEE, vol. 88, no 4, pp. 451-513, 2000.
Dr. Suresh K
Department of Electronics & Communication Government Engineering College Wayanad, Kerala, India, 670644 - India
suresh.kumaraswamy@gmail.com
Mr. Akhil Raj R
Department of Electronics & Communication College of Engineering, Thiruvananthapuram Kerala, India, 695016 - India