DocumentCode :
312198
Title :
Analysis of speech segments using variable spectral/temporal resolution
Author :
Wang, Xihong ; Zahorian, Stephen A. ; Auberg, Stefan
Author_Institution :
Dept. of Electr. & Comput. Eng., Old Dominion Univ., Norfolk, VA, USA
Volume :
2
fYear :
1996
fDate :
3-6 Oct 1996
Firstpage :
1221
Abstract :
The authors present an approach for efficiently computing a compact temporal/spectral feature set for representing a segment of speech, with effective resolution depending on both frequency and time position within the segment. The goal is to mimic the resolution properties of the human auditory system, but using a computationally efficient FFT-based front end rather than a more complex auditory model. In particular they apply both frequency and time “warping” to FFT spectra to obtain good frequency resolution at low frequencies and good time resolution at high frequencies. Time resolution is also varied so that the center of the segment is better represented than the endpoints. The resolution can be varied by the selection of “warping” functions controlled using a small number of parameters. The method was experimentally verified for the classification of six stops extracted from the TIMIT continuous speech database. The best classification rate obtained was 81.2% for test data using 50 features computed with the method presented
Keywords :
fast Fourier transforms; spectral analysis; speech processing; FFT spectra; TIMIT continuous speech database; classification rate; compact spectral feature set; compact temporal feature set; computationally efficient FFT-based front end; frequency position; frequency warping; human auditory system; resolution; speech segment analysis; time position; time warping; variable spectral resolution; variable temporal resolution; warping functions; Auditory system; Data mining; Fast Fourier transforms; Frequency conversion; Humans; Signal resolution; Spectral analysis; Speech analysis; Testing; Time frequency analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
Type :
conf
DOI :
10.1109/ICSLP.1996.607828
Filename :
607828
Link To Document :
بازگشت