• DocumentCode
    454738
  • Title

    Multi-Parameter Frequency Warping for Vtln by Gradient Search

  • Author

    Panchapagesan, Sankaran ; Alwan, Abeer

  • Author_Institution
    Dept. of Electr. Eng., California Univ., Los Angeles, CA
  • Volume
    1
  • fYear
    2006
  • fDate
    14-19 May 2006
  • Abstract
    The current method for estimating frequency warping (FW) functions for vocal tract length normalization (VTLN) is by maximizing the ASR likelihood score by an exhaustive search over a grid of FW parameters. Exhaustive search is inefficient when estimating multi-parameter FWs, which have been shown to give improvements in recognition accuracy over single parameter FWs (J.W. McDonough, 2000). Here we develop a gradient search algorithm to obtain the optimal FW parameters for MFCC features, since previous work focussed on PLP cepstral features (J.W. McDonough, 2000). The novel calculation involved was that of the gradient of the Mel filterbank with respect to the FW parameters. Even for a single parameter, the gradient search method was more efficient than grid search by a factor of around 1.6 on the average for male children speakers tested on models trained from adult males. When used to estimate multi-parameter sine-log allpass transform (SLAPT, (J.W. McDonough, 2000)) FWs for VTLN, more than 50% reduction in word error rate was obtained with five parameter SLAPT compared to single-parameter piecewise linear FW
  • Keywords
    channel bank filters; gradient methods; search problems; speech processing; transforms; MFCC features; Mel filterbank; gradient search algorithm; multiparameter frequency warping; sine-log allpass transform; vocal tract length normalization; Automatic speech recognition; Cepstral analysis; Collision mitigation; Filter bank; Frequency estimation; Mel frequency cepstral coefficient; Parameter estimation; Piecewise linear techniques; Search methods; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
  • Conference_Location
    Toulouse
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0469-X
  • Type

    conf

  • DOI
    10.1109/ICASSP.2006.1660237
  • Filename
    1660237