An improved HMM/VQ training procedure for speaker-independent isolated word recognition

Author

Zhang, Yaxin ; Alder, Mike

Author_Institution

Dept. of Electr. & Electron. Eng., Western Australia Univ., Nedlands, WA, Australia

fYear

1994

fDate

13-16 Apr 1994

Firstpage

722

Abstract

This paper describe an improved training procedure in a HMM/VQ speech recognition system for speaker-independent speech recognition. The phoneme based Gaussian mixture models (GMM) were generated in the first step modeling using the Expectation-Maximization (EM) algorithm. These Gaussians more accurately describe the distribution characteristic of the phonemes in the speech signal space. Therefore better first step modeling is achieved and the performance of the whole recognition system is improved. The new method was used in a speaker-independent isolated digits and phoneme recognition tasks. Two English databases were used for the training and testing. Significant improvements have been achieved in comparison with the conventional HMM/VQ system

Keywords

hidden Markov models; speech recognition; stochastic processes; vector quantisation; English databases; HMM/VQ training procedure; distribution characteristic; expectation-maximization algorithm; phoneme based Gaussian mixture models; speaker-independent isolated word recognition; speech signal space; Books; Clustering algorithms; Hidden Markov models; Image coding; Image recognition; Signal generators; Signal processing; Signal processing algorithms; Speech recognition; Vector quantization;

fLanguage

English

Publisher

ieee

Conference_Titel

Speech, Image Processing and Neural Networks, 1994. Proceedings, ISSIPNN '94., 1994 International Symposium on

Print_ISBN

0-7803-1865-X

Type

conf

DOI

10.1109/SIPNN.1994.344810

Filename

344810