Title :
A bilinear transform approach for vocal tract length normalization
Author :
Wang, Xu ; Bing-Xi, Wang ; Qi, Ding
Author_Institution :
Inf. Eng. Univ., Henan, China
Abstract :
We have developed and evaluated a set of speaker normalization procedures derived by bilinear transform (BLT) to compensate for variations in vocal tract lengths of different classes of speakers. The warping factors are estimated using the average third formants and their bandwidth, leaving out the exhaustive search. The MFCC of the testing data are transformed by the warped Mel filterbanks to match the models of the training data. The effectiveness of this set of speaker normalization procedures is examined in an experimental study performed using an isolated digit database of man, woman and children comparing to other standard speaker normalization method. The results of experiments demonstrate their capacity to achieve recognition accuracy increase of 19.5% and 16.5% at the best.
Keywords :
channel bank filters; speaker recognition; speech processing; transforms; Mel filterbank; bilinear transform; speaker normalization procedure; speech recognition; vocal tract length normalization; warping factors; Bandwidth; Databases; Feature extraction; Iterative methods; Matched filters; Mel frequency cepstral coefficient; Piecewise linear techniques; Speech recognition; Testing; Training data;
Conference_Titel :
Control, Automation, Robotics and Vision Conference, 2004. ICARCV 2004 8th
Print_ISBN :
0-7803-8653-1
DOI :
10.1109/ICARCV.2004.1468885