DocumentCode
3549601
Title
A bilinear transform approach for vocal tract length normalization
Author
Wang, Xu ; Bing-Xi, Wang ; Qi, Ding
Author_Institution
Inf. Eng. Univ., Henan, China
Volume
1
fYear
2004
fDate
6-9 Dec. 2004
Firstpage
547
Abstract
We have developed and evaluated a set of speaker normalization procedures derived by bilinear transform (BLT) to compensate for variations in vocal tract lengths of different classes of speakers. The warping factors are estimated using the average third formants and their bandwidth, leaving out the exhaustive search. The MFCC of the testing data are transformed by the warped Mel filterbanks to match the models of the training data. The effectiveness of this set of speaker normalization procedures is examined in an experimental study performed using an isolated digit database of man, woman and children comparing to other standard speaker normalization method. The results of experiments demonstrate their capacity to achieve recognition accuracy increase of 19.5% and 16.5% at the best.
Keywords
channel bank filters; speaker recognition; speech processing; transforms; Mel filterbank; bilinear transform; speaker normalization procedure; speech recognition; vocal tract length normalization; warping factors; Bandwidth; Databases; Feature extraction; Iterative methods; Matched filters; Mel frequency cepstral coefficient; Piecewise linear techniques; Speech recognition; Testing; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Control, Automation, Robotics and Vision Conference, 2004. ICARCV 2004 8th
Print_ISBN
0-7803-8653-1
Type
conf
DOI
10.1109/ICARCV.2004.1468885
Filename
1468885
Link To Document