DocumentCode
454738
Title
Multi-Parameter Frequency Warping for Vtln by Gradient Search
Author
Panchapagesan, Sankaran ; Alwan, Abeer
Author_Institution
Dept. of Electr. Eng., California Univ., Los Angeles, CA
Volume
1
fYear
2006
fDate
14-19 May 2006
Abstract
The current method for estimating frequency warping (FW) functions for vocal tract length normalization (VTLN) is by maximizing the ASR likelihood score by an exhaustive search over a grid of FW parameters. Exhaustive search is inefficient when estimating multi-parameter FWs, which have been shown to give improvements in recognition accuracy over single parameter FWs (J.W. McDonough, 2000). Here we develop a gradient search algorithm to obtain the optimal FW parameters for MFCC features, since previous work focussed on PLP cepstral features (J.W. McDonough, 2000). The novel calculation involved was that of the gradient of the Mel filterbank with respect to the FW parameters. Even for a single parameter, the gradient search method was more efficient than grid search by a factor of around 1.6 on the average for male children speakers tested on models trained from adult males. When used to estimate multi-parameter sine-log allpass transform (SLAPT, (J.W. McDonough, 2000)) FWs for VTLN, more than 50% reduction in word error rate was obtained with five parameter SLAPT compared to single-parameter piecewise linear FW
Keywords
channel bank filters; gradient methods; search problems; speech processing; transforms; MFCC features; Mel filterbank; gradient search algorithm; multiparameter frequency warping; sine-log allpass transform; vocal tract length normalization; Automatic speech recognition; Cepstral analysis; Collision mitigation; Filter bank; Frequency estimation; Mel frequency cepstral coefficient; Parameter estimation; Piecewise linear techniques; Search methods; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location
Toulouse
ISSN
1520-6149
Print_ISBN
1-4244-0469-X
Type
conf
DOI
10.1109/ICASSP.2006.1660237
Filename
1660237
Link To Document