• DocumentCode
    3549601
  • Title

    A bilinear transform approach for vocal tract length normalization

  • Author

    Wang, Xu ; Bing-Xi, Wang ; Qi, Ding

  • Author_Institution
    Inf. Eng. Univ., Henan, China
  • Volume
    1
  • fYear
    2004
  • fDate
    6-9 Dec. 2004
  • Firstpage
    547
  • Abstract
    We have developed and evaluated a set of speaker normalization procedures derived by bilinear transform (BLT) to compensate for variations in vocal tract lengths of different classes of speakers. The warping factors are estimated using the average third formants and their bandwidth, leaving out the exhaustive search. The MFCC of the testing data are transformed by the warped Mel filterbanks to match the models of the training data. The effectiveness of this set of speaker normalization procedures is examined in an experimental study performed using an isolated digit database of man, woman and children comparing to other standard speaker normalization method. The results of experiments demonstrate their capacity to achieve recognition accuracy increase of 19.5% and 16.5% at the best.
  • Keywords
    channel bank filters; speaker recognition; speech processing; transforms; Mel filterbank; bilinear transform; speaker normalization procedure; speech recognition; vocal tract length normalization; warping factors; Bandwidth; Databases; Feature extraction; Iterative methods; Matched filters; Mel frequency cepstral coefficient; Piecewise linear techniques; Speech recognition; Testing; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Control, Automation, Robotics and Vision Conference, 2004. ICARCV 2004 8th
  • Print_ISBN
    0-7803-8653-1
  • Type

    conf

  • DOI
    10.1109/ICARCV.2004.1468885
  • Filename
    1468885