• DocumentCode
    703420
  • Title

    Speaker normalization for automatic speech recognition — An on-line approach

  • Author

    Dologlou, Ioannis ; Claes, Tom ; ten Bosch, Louis ; Van Compernolle, Dirk ; Van hamme, Hugo

  • Author_Institution
    ESAT - PSI, Katholieke Univ. Leuven, Leuven, Belgium
  • fYear
    1998
  • fDate
    8-11 Sept. 1998
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    We propose a method to transform the on line speech signal so as to comply with the specifications of an HMM-based automatic speech recognizer. The spectrum of the input signal undergoes a vocal tract length (VTL) normalization based on differences of the average third formant F3. The high frequency gap which is generated after scaling is estimated by means of an extrapolation scheme. Mel scale cepstral coefficients (MFCC) are used along with delta and delta2-cepstra as well as delta and delta2 energy. The method has been tested on the TI digits database which contains adult and kids speech providing substantial gains with respect to non normalized speech.
  • Keywords
    cepstral analysis; extrapolation; hidden Markov models; speaker recognition; HMM-based automatic speech recognizer; MFCC; Mel scale cepstral coefficients; TI digits database; VTL normalization; automatic speech recognition; delta; delta2 energy; delta2-cepstra; extrapolation scheme; frequency gap estimation; signal spectrum; speaker normalization; vocal tract length; Extrapolation; Frequency estimation; Hidden Markov models; Interpolation; Mel frequency cepstral coefficient; Speech; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference (EUSIPCO 1998), 9th European
  • Conference_Location
    Rhodes
  • Print_ISBN
    978-960-7620-06-4
  • Type

    conf

  • Filename
    7089891