DocumentCode :
394688
Title :
Efficient scalable coding of stereophonic audio by conditional quantization and estimation-theoretic prediction
Author :
Aggarwal, A. ; Ryu, Sung-Uk ; Rose, Kenneth
Author_Institution :
Dept. of Electr. & Comput. Eng., California Univ., Santa Barbara, CA, USA
Volume :
5
fYear :
2003
fDate :
6-10 April 2003
Abstract :
The standard scalable coding of stereophonic audio suffers from significant performance loss because of (1) poor prediction gain at the enhancement-layer and (2) direct requantization of the reconstruction error, which is suboptimal for the noise-mask ratio (NMR) criterion. To mitigate such performance loss, this paper proposes an integrated approach which employs two complementary techniques, namely, the estimation theoretic (ET) predictor and the conditional enhancement-layer quantizer (CELQ). The ET predictor has been shown to combine information from various sources for efficient enhancement-layer prediction, while CELQ efficiently handles scalable quantization to minimize NMR. We demonstrate that the proposed combined approach can achieve major performance gains in terms of bit rate reduction and reconstruction quality enhancement. For example, the proposed 2×16 kbit/s two layer coder achieves considerably improved reconstruction quality compared to that of the conventional 4×16 kbit/s four layer coder, despite expending only 50% of the standard scalable coder bit rate.
Keywords :
audio coding; minimisation; parameter estimation; quantisation (signal); signal reconstruction; variable rate codes; CELQ; ET predictor; NMR minimization; bit rate reduction; conditional enhancement-layer quantizer; conditional quantization; efficient scalable coding; enhancement-layer prediction; estimation theoretic predictor; estimation-theoretic prediction; performance gains; performance loss; reconstruction quality enhancement; scalable quantization; stereophonic audio; two layer coder; Audio compression; Bit rate; Nuclear magnetic resonance; Performance gain; Performance loss; Quantization; Redundancy; Scalability; Signal to noise ratio; Streaming media;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-7663-3
Type :
conf
DOI :
10.1109/ICASSP.2003.1200007
Filename :
1200007
Link To Document :
بازگشت