DocumentCode
312198
Title
Analysis of speech segments using variable spectral/temporal resolution
Author
Wang, Xihong ; Zahorian, Stephen A. ; Auberg, Stefan
Author_Institution
Dept. of Electr. & Comput. Eng., Old Dominion Univ., Norfolk, VA, USA
Volume
2
fYear
1996
fDate
3-6 Oct 1996
Firstpage
1221
Abstract
The authors present an approach for efficiently computing a compact temporal/spectral feature set for representing a segment of speech, with effective resolution depending on both frequency and time position within the segment. The goal is to mimic the resolution properties of the human auditory system, but using a computationally efficient FFT-based front end rather than a more complex auditory model. In particular they apply both frequency and time “warping” to FFT spectra to obtain good frequency resolution at low frequencies and good time resolution at high frequencies. Time resolution is also varied so that the center of the segment is better represented than the endpoints. The resolution can be varied by the selection of “warping” functions controlled using a small number of parameters. The method was experimentally verified for the classification of six stops extracted from the TIMIT continuous speech database. The best classification rate obtained was 81.2% for test data using 50 features computed with the method presented
Keywords
fast Fourier transforms; spectral analysis; speech processing; FFT spectra; TIMIT continuous speech database; classification rate; compact spectral feature set; compact temporal feature set; computationally efficient FFT-based front end; frequency position; frequency warping; human auditory system; resolution; speech segment analysis; time position; time warping; variable spectral resolution; variable temporal resolution; warping functions; Auditory system; Data mining; Fast Fourier transforms; Frequency conversion; Humans; Signal resolution; Spectral analysis; Speech analysis; Testing; Time frequency analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location
Philadelphia, PA
Print_ISBN
0-7803-3555-4
Type
conf
DOI
10.1109/ICSLP.1996.607828
Filename
607828
Link To Document