DocumentCode
2286830
Title
An improved HMM/VQ training procedure for speaker-independent isolated word recognition
Author
Zhang, Yaxin ; Alder, Mike
Author_Institution
Dept. of Electr. & Electron. Eng., Western Australia Univ., Nedlands, WA, Australia
fYear
1994
fDate
13-16 Apr 1994
Firstpage
722
Abstract
This paper describe an improved training procedure in a HMM/VQ speech recognition system for speaker-independent speech recognition. The phoneme based Gaussian mixture models (GMM) were generated in the first step modeling using the Expectation-Maximization (EM) algorithm. These Gaussians more accurately describe the distribution characteristic of the phonemes in the speech signal space. Therefore better first step modeling is achieved and the performance of the whole recognition system is improved. The new method was used in a speaker-independent isolated digits and phoneme recognition tasks. Two English databases were used for the training and testing. Significant improvements have been achieved in comparison with the conventional HMM/VQ system
Keywords
hidden Markov models; speech recognition; stochastic processes; vector quantisation; English databases; HMM/VQ training procedure; distribution characteristic; expectation-maximization algorithm; phoneme based Gaussian mixture models; speaker-independent isolated word recognition; speech signal space; Books; Clustering algorithms; Hidden Markov models; Image coding; Image recognition; Signal generators; Signal processing; Signal processing algorithms; Speech recognition; Vector quantization;
fLanguage
English
Publisher
ieee
Conference_Titel
Speech, Image Processing and Neural Networks, 1994. Proceedings, ISSIPNN '94., 1994 International Symposium on
Print_ISBN
0-7803-1865-X
Type
conf
DOI
10.1109/SIPNN.1994.344810
Filename
344810
Link To Document