Title :
Speaker normalization for automatic speech recognition — An on-line approach
Author :
Dologlou, Ioannis ; Claes, Tom ; ten Bosch, Louis ; Van Compernolle, Dirk ; Van hamme, Hugo
Author_Institution :
ESAT - PSI, Katholieke Univ. Leuven, Leuven, Belgium
Abstract :
We propose a method to transform the on line speech signal so as to comply with the specifications of an HMM-based automatic speech recognizer. The spectrum of the input signal undergoes a vocal tract length (VTL) normalization based on differences of the average third formant F3. The high frequency gap which is generated after scaling is estimated by means of an extrapolation scheme. Mel scale cepstral coefficients (MFCC) are used along with delta and delta2-cepstra as well as delta and delta2 energy. The method has been tested on the TI digits database which contains adult and kids speech providing substantial gains with respect to non normalized speech.
Keywords :
cepstral analysis; extrapolation; hidden Markov models; speaker recognition; HMM-based automatic speech recognizer; MFCC; Mel scale cepstral coefficients; TI digits database; VTL normalization; automatic speech recognition; delta; delta2 energy; delta2-cepstra; extrapolation scheme; frequency gap estimation; signal spectrum; speaker normalization; vocal tract length; Extrapolation; Frequency estimation; Hidden Markov models; Interpolation; Mel frequency cepstral coefficient; Speech; Speech recognition;
Conference_Titel :
Signal Processing Conference (EUSIPCO 1998), 9th European
Conference_Location :
Rhodes
Print_ISBN :
978-960-7620-06-4