DocumentCode :
3406909
Title :
A frequency warping approach for vocal tract length normalization
Author :
Qi, Ding ; Wang, Xu ; Bingxi, Wang
Author_Institution :
Inf. Eng. Univ., Henan, China
Volume :
1
fYear :
2004
fDate :
31 Aug.-4 Sept. 2004
Firstpage :
691
Abstract :
A method of vocal tract length normalization (VTLN) is proposed. It uses bilinear transform (BLT) to modify the filterbank in Mel-frequency cepstrum based on the average third formant F3. The effectiveness of this method is examined on vowel and isolated digit recognitions. The baseline vowel recognition models are trained on males data and the baseline isolated digit models are trained on adult men´s data respectively. When the MFCC coefficients of test data are transformed by BLT, the recognition accuracy of females´ vowels is improved by 11.67% and the recognition accuracies of adult women and children´s isolated digits are improved by 19.5% and 13% respectively.
Keywords :
channel bank filters; speaker recognition; speech synthesis; transforms; Mel-frequency cepstrum; baseline vowel recognition; bilinear transform; frequency warping; speech recognition; vocal tract length normalization; Acoustics; Filter bank; Frequency estimation; Loudspeakers; Low pass filters; Mel frequency cepstral coefficient; Piecewise linear techniques; Speech processing; Speech recognition; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing, 2004. Proceedings. ICSP '04. 2004 7th International Conference on
Print_ISBN :
0-7803-8406-7
Type :
conf
DOI :
10.1109/ICOSP.2004.1452757
Filename :
1452757
Link To Document :
بازگشت