DocumentCode
1871254
Title
Speech analysis and coding using a multi-resolution sinusoidal transform
Author
Anderson, David V.
Author_Institution
Sch. of Electr. & Compu. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
Volume
2
fYear
1996
fDate
7-10 May 1996
Firstpage
1037
Abstract
The sinusoidal transform, as developed by Quatieri and McAulay (1986), provides a sparse representation for speech signals by taking advantage of psychoacoustic masking. The currently reported work takes the sinusoidal transform one step further by considering the frequency resolution abilities of the human auditory system in more detail. The new transform is based on the wavelet principle of variable resolution in time/frequency analysis. Specifically, a sinusoidal transform is developed which uses quadrature mirror filter (QMF) banks to obtain better time resolution at high frequencies and better frequency resolution at low frequencies. This naturally provides a perceptually improved allocation of the sinusoids. The new transform matches the human auditory system better than its predecessor and it also matches speech signals well, both fricative sounds and voiced speech. The QMF based ST is then shown to be equivalent to a more efficient FFT based implementation
Keywords
band-pass filters; discrete Fourier transforms; filtering theory; hearing; quadrature mirror filters; signal representation; signal resolution; speech coding; speech intelligibility; speech processing; time-frequency analysis; transform coding; wavelet transforms; DFT; QMF banks; frequency resolution; fricative sounds; high frequencies; human auditory system; low frequencies; multiresolution sinusoidal transform; psychoacoustic masking; quadrature mirror filter banks; sparse representation; speech analysis; speech coding; speech signals; time resolution; time/frequency analysis; variable resolution; voiced speech; wavelet transforms; Auditory system; Frequency; Humans; Mirrors; Psychology; Signal resolution; Speech analysis; Speech coding; Wavelet analysis; Wavelet transforms;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location
Atlanta, GA
ISSN
1520-6149
Print_ISBN
0-7803-3192-3
Type
conf
DOI
10.1109/ICASSP.1996.543301
Filename
543301
Link To Document