DocumentCode :
1911420
Title :
Audio compression at low bit rates using a signal adaptive switched filterbank
Author :
Sinha, Deepen ; Johnston, James D.
Author_Institution :
AT&T Bell Labs., Murray Hill, NJ, USA
Volume :
2
fYear :
1996
fDate :
7-10 May 1996
Firstpage :
1053
Abstract :
A perceptual audio coder typically consists of a filter-bank which breaks the signal into its frequency components. These components are then quantized using a perceptual masking model. Previous efforts have indicated that a high resolution filter-bank, e.g., the modified discrete cosine transform (MDCT) with 1024 subbands, is able to minimize the bit rate requirements for most of the music samples. The high resolution MDCT, however, is not suitable for the encoding of non-stationary segments of music. A long/short resolution or “window” switching scheme has been employed to overcome this problem but it has certain inherent disadvantages which become prominent at lower bit rates (<64 kbps for stereo). We propose a novel switched filter-bank scheme which switches between a MDCT and a wavelet filter-bank based on the signal characteristics. A tree structured wavelet filter-bank with properly designed filters offers natural advantages for the representation of non-stationary segments such as attacks. Furthermore, it allows for the optimum exploitation of perceptual irrelevancies
Keywords :
adaptive filters; adaptive signal processing; audio coding; band-pass filters; channel capacity; data compression; discrete cosine transforms; filtering theory; music; signal resolution; transform coding; wavelet transforms; audio compression; frequency components; high resolution MDCT; high resolution filter bank; low bit rates; modified discrete cosine transform; music samples; nonstationary music segments; perceptual audio coder; perceptual masking model; quantization; signal adaptive switched filterbank; signal characteristics; signal representation; tree structured wavelet filter bank; window switching scheme; Audio compression; Bit rate; Discrete cosine transforms; Energy states; Filter bank; Frequency; Signal design; Signal resolution; Switches; Wideband;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location :
Atlanta, GA
ISSN :
1520-6149
Print_ISBN :
0-7803-3192-3
Type :
conf
DOI :
10.1109/ICASSP.1996.543544
Filename :
543544
Link To Document :
بازگشت