DocumentCode :
3343626
Title :
Wideband speech and audio coding using gammatone filter banks
Author :
Ambikairajah, E. ; Epps, Julien ; Lin, Lee
Author_Institution :
Sch. of Electr. Eng. & Telecommun., New South Wales Univ., Sydney, NSW, Australia
Volume :
2
fYear :
2001
fDate :
2001
Firstpage :
773
Abstract :
Considerable research attention has been directed towards speech and audio coding algorithms capable of producing high quality coded speech and audio, however few of these use signal representations which account for temporal as well as spectral detail. This paper presents a new technique for 16 kHz wideband speech and audio coding, whereby analysis and synthesis are performed using a linear phase gammatone filter bank. The outputs of these critical band filters are processed to obtain a series of pulse trains that represent neural firing. Auditory masking is then applied to reduce the number of pulses, producing a more compact time-frequency parameterization. The critical band gains and pulse amplitudes and positions are then coded using a combination of non-uniform quantization, arithmetic coding and vector quantization. This coding paradigm produces high quality coded speech and audio, is based upon well-known models of the auditory system, is highly scalable, and has moderate complexity
Keywords :
arithmetic codes; audio coding; channel bank filters; hearing; linear phase filters; signal representation; speech coding; speech intelligibility; speech synthesis; time-frequency analysis; vector quantisation; 16 kHz; arithmetic coding; audio analysis; audio coding algorithms; audio quality; audio synthesis; auditory masking; auditory system models; coding paradigm; critical band filters; critical band gain; linear phase gammatone filter bank; neural firing; nonuniform quantization; pulse amplitude; pulse position; pulse trains; signal representation; speech analysis; speech coding algorithms; speech quality; speech synthesis; time-frequency parameterization; vector quantization; wideband audio coding; wideband speech coding; Audio coding; Filter bank; Performance analysis; Signal representations; Signal synthesis; Speech analysis; Speech coding; Speech synthesis; Time frequency analysis; Wideband;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
ISSN :
1520-6149
Print_ISBN :
0-7803-7041-4
Type :
conf
DOI :
10.1109/ICASSP.2001.941029
Filename :
941029
Link To Document :
بازگشت