DocumentCode :
1871254
Title :
Speech analysis and coding using a multi-resolution sinusoidal transform
Author :
Anderson, David V.
Author_Institution :
Sch. of Electr. & Compu. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
Volume :
2
fYear :
1996
fDate :
7-10 May 1996
Firstpage :
1037
Abstract :
The sinusoidal transform, as developed by Quatieri and McAulay (1986), provides a sparse representation for speech signals by taking advantage of psychoacoustic masking. The currently reported work takes the sinusoidal transform one step further by considering the frequency resolution abilities of the human auditory system in more detail. The new transform is based on the wavelet principle of variable resolution in time/frequency analysis. Specifically, a sinusoidal transform is developed which uses quadrature mirror filter (QMF) banks to obtain better time resolution at high frequencies and better frequency resolution at low frequencies. This naturally provides a perceptually improved allocation of the sinusoids. The new transform matches the human auditory system better than its predecessor and it also matches speech signals well, both fricative sounds and voiced speech. The QMF based ST is then shown to be equivalent to a more efficient FFT based implementation
Keywords :
band-pass filters; discrete Fourier transforms; filtering theory; hearing; quadrature mirror filters; signal representation; signal resolution; speech coding; speech intelligibility; speech processing; time-frequency analysis; transform coding; wavelet transforms; DFT; QMF banks; frequency resolution; fricative sounds; high frequencies; human auditory system; low frequencies; multiresolution sinusoidal transform; psychoacoustic masking; quadrature mirror filter banks; sparse representation; speech analysis; speech coding; speech signals; time resolution; time/frequency analysis; variable resolution; voiced speech; wavelet transforms; Auditory system; Frequency; Humans; Mirrors; Psychology; Signal resolution; Speech analysis; Speech coding; Wavelet analysis; Wavelet transforms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location :
Atlanta, GA
ISSN :
1520-6149
Print_ISBN :
0-7803-3192-3
Type :
conf
DOI :
10.1109/ICASSP.1996.543301
Filename :
543301
Link To Document :
بازگشت