DocumentCode
2334179
Title
Frequency-Warping Invariant Features for Automatic Speech Recognition
Author
Mertins, Alfred ; Rademacher, Jan
Author_Institution
Dept. of Phys., Oldenburg Univ.
Volume
5
fYear
2006
fDate
14-19 May 2006
Abstract
Based on the well-known relationship between vocal tract length (VTL) variation and linear frequency warping, we present a method for generating vocal tract length invariant (VTLI) features. These features are computed as translation invariant, correlation-type features in a log-frequency domain. In phoneme classification and recognition experiments on the TIMIT database, their discrimination capabilities and robustness to mismatches between training and test conditions turned out to be considerably better than for Mel-frequency cepstral coefficients (MFCCs). The best results are obtained when VTLI features and MFCCs are combined
Keywords
cepstral analysis; correlation methods; frequency-domain analysis; signal classification; speech recognition; wavelet transforms; Mel-frequency cepstral coefficients; TIMIT database; automatic speech recognition; correlation-type features; discrimination capabilities; frequency-warping invariant features; linear frequency warping; log-frequency domain; phoneme classification; recognition experiments; translation invariant; vocal tract length variation; Automatic speech recognition; Bandwidth; Continuous wavelet transforms; Discrete wavelet transforms; Fourier transforms; Frequency; Hidden Markov models; Robustness; Testing; Wavelet transforms;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location
Toulouse
ISSN
1520-6149
Print_ISBN
1-4244-0469-X
Type
conf
DOI
10.1109/ICASSP.2006.1661453
Filename
1661453
Link To Document