DocumentCode :
310635
Title :
Perceptual speech coding using time and frequency masking constraints
Author :
Garnero, B. ; Drygajlo, Andrzej
Author_Institution :
Signal Process. Lab., Swiss Fed. Inst. of Technol., Lausanne, Switzerland
Volume :
2
fYear :
1997
fDate :
21-24 Apr 1997
Firstpage :
1363
Abstract :
This paper presents a new wide-band speech coding system based on a fast wavelet packet transform algorithm as well as a formulation of temporal and spectral psychoacoustic models of masking. The proposed FFT-like overlapped block orthogonal transform allows us to approximate the auditory critical band decomposition in an efficient manner, which is a major advantage over previous approaches that used uniform filter banks. As a result of such a decomposition, the perceptually tuned time-frequency structure of the original speech signal is preserved. This allows us to make use of the temporal and spectral properties of the human auditory system to decrease the average bit rate of the encoder, while perceptually hiding the quantization error
Keywords :
hearing; spectral analysis; speech coding; speech intelligibility; speech processing; time-frequency analysis; transform coding; wavelet transforms; FFT; auditory critical band decomposition; average bit rate reduction; fast wavelet packet transform algorithm; frequency masking constraints; human auditory system; overlapped block orthogonal transform; perceptual speech coding; perceptually tuned time-frequency structure; quantization error; spectral properties; spectral psychoacoustic models; speech signal; temporal properties; temporal psychoacoustic models; time masking constraints; wideband speech coding system; Auditory system; Bit rate; Filter bank; Humans; Psychoacoustic models; Speech coding; Time frequency analysis; Wavelet packets; Wavelet transforms; Wideband;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location :
Munich
ISSN :
1520-6149
Print_ISBN :
0-8186-7919-0
Type :
conf
DOI :
10.1109/ICASSP.1997.596200
Filename :
596200
Link To Document :
بازگشت