DocumentCode :
319590
Title :
An acoustic front-end using warped frequency and temporal resolutions
Author :
Lilly, B.T. ; Paliwal, K.K.
Author_Institution :
Sch. of Microelectron. Eng., Griffith Univ., Brisbane, Qld., Australia
Volume :
1
fYear :
1997
fDate :
4-4 Dec. 1997
Firstpage :
133
Abstract :
Typically, the power spectrum of a speech frame used in speech recognition is estimated for a fixed length window using the fast Fourier transform. Each frequency component represented in this power spectrum is an estimate over that speech frame. The power spectrum calculated in this way has a constant time and frequency resolution. An example of this type of front-end is the LPC-derived cepstral front-end commonly used is recognition systems today. The acoustic front-end presented in this paper employs both a warped frequency and temporal resolutions. We show that a front-end that utilises both warping functions, outperforms a front-end that employs only a warped frequency scale. We also show that this new front-end is unsuitable for noisy conditions.
Keywords :
acoustic signal processing; FIR bandpass filter bank; LPC-derived cepstral front-end; acoustic front-end; fast Fourier transform; fixed length window; frequency component; human auditory system; power spectrum; speech frame; speech recognition; time resolution; warped frequency resolution; warped temporal resolution; Band pass filters; Bandwidth; Cepstral analysis; Ear; Filter bank; Finite impulse response filter; Frequency estimation; Humans; Signal resolution; Speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
TENCON '97. IEEE Region 10 Annual Conference. Speech and Image Technologies for Computing and Telecommunications., Proceedings of IEEE
Conference_Location :
Brisbane, Qld., Australia
Print_ISBN :
0-7803-4365-4
Type :
conf
DOI :
10.1109/TENCON.1997.647275
Filename :
647275
Link To Document :
بازگشت