• DocumentCode
    2286830
  • Title

    An improved HMM/VQ training procedure for speaker-independent isolated word recognition

  • Author

    Zhang, Yaxin ; Alder, Mike

  • Author_Institution
    Dept. of Electr. & Electron. Eng., Western Australia Univ., Nedlands, WA, Australia
  • fYear
    1994
  • fDate
    13-16 Apr 1994
  • Firstpage
    722
  • Abstract
    This paper describe an improved training procedure in a HMM/VQ speech recognition system for speaker-independent speech recognition. The phoneme based Gaussian mixture models (GMM) were generated in the first step modeling using the Expectation-Maximization (EM) algorithm. These Gaussians more accurately describe the distribution characteristic of the phonemes in the speech signal space. Therefore better first step modeling is achieved and the performance of the whole recognition system is improved. The new method was used in a speaker-independent isolated digits and phoneme recognition tasks. Two English databases were used for the training and testing. Significant improvements have been achieved in comparison with the conventional HMM/VQ system
  • Keywords
    hidden Markov models; speech recognition; stochastic processes; vector quantisation; English databases; HMM/VQ training procedure; distribution characteristic; expectation-maximization algorithm; phoneme based Gaussian mixture models; speaker-independent isolated word recognition; speech signal space; Books; Clustering algorithms; Hidden Markov models; Image coding; Image recognition; Signal generators; Signal processing; Signal processing algorithms; Speech recognition; Vector quantization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Speech, Image Processing and Neural Networks, 1994. Proceedings, ISSIPNN '94., 1994 International Symposium on
  • Print_ISBN
    0-7803-1865-X
  • Type

    conf

  • DOI
    10.1109/SIPNN.1994.344810
  • Filename
    344810