DocumentCode :
2467756
Title :
An efficient, low-complexity audio coder delivering multiple levels of quality for interactive applications
Author :
Lu, Zhitao ; Pearlman, William A.
Author_Institution :
Dept. of Electr. Comput. & Syst. Eng., Rensselaer Polytech. Inst., Troy, NY, USA
fYear :
1998
fDate :
7-9 Dec 1998
Firstpage :
529
Lastpage :
534
Abstract :
This paper proposes an efficient, low complexity audio coder based on the SPIHT (set partitioning in hierarchical trees) coding algorithm , which has achieved notable success in still image coding. A wavelet packet transform is used to decompose the audio signal into 29 frequency subbands corresponding roughly to the critical subbands of the human auditory system. A psychoacoustic model, which, for simplicity, is based on MPEG model I, is used to calculate the signal to mask ratio, and then calculate the bit rate allocation among subbands. We distinguish the subbands into two groups: the low frequency group which contains the first 17 subbands corresponding to 0-3.4 kHz, and the high frequency group which contains the remaining high frequency subbands. The SPIHT algorithm is used to encode and decode the low frequency group and a reverse sorting process plus arithmetic coding algorithm is used to encode and decode the high frequency group. The experiment shows that this coder yields nearly transparent quality at bit rates 55-66 kbits/sec, and degrades only gradually at lower rates. The low complexity of this coding system shows its potential for interactive applications with levels of quality from good to perceptually transparent
Keywords :
arithmetic codes; audio coding; computational complexity; decoding; interactive systems; set theory; trees (mathematics); wavelet transforms; 0 Hz to 3.4 kHz; 55 to 66 kbit/s; MPEG model I; SPIHT coding algorithm; arithmetic coding algorithm; audio signal decomposition; bit rate allocation; critical subbands; decoding; experiment; frequency subbands; high frequency group; high frequency subbands; human auditory system; interactive applications; low frequency group; low-complexity audio coder; perceptually transparent quality; psychoacoustic model; quality levels; reverse sorting process; set partitioning in hierarchical trees; signal to mask ratio; still image coding; wavelet packet transform; Auditory system; Bit rate; Decoding; Frequency; Humans; Image coding; Partitioning algorithms; Psychoacoustic models; Wavelet packets; Wavelet transforms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia Signal Processing, 1998 IEEE Second Workshop on
Conference_Location :
Redondo Beach, CA
Print_ISBN :
0-7803-4919-9
Type :
conf
DOI :
10.1109/MMSP.1998.739035
Filename :
739035
Link To Document :
بازگشت