Title :
Variable parameter Gaussian mixture hidden Markov modeling for speech recognition
Author :
Cui, Xiaodong ; Gong, Yifan
Author_Institution :
Dept. of Electr. Eng., Univ. of California, Los Angeles, CA, USA
Abstract :
To improve recognition, speech signals corrupted by a variety of noises can be used in speech model training. Published hidden Markov modeling of speech uses multiple Gaussian distributions to cover the spread of the speech distribution caused by the noises, which distracts the modeling of speech event itself and and possibly sacrifices the performance on clean speech. We extend GMHMM by allowing state emission parameters to change as function of an environment-dependent continuous variable. At the recognition time, a set of HMMs specific to the given the environment is instantiated and used for recognition. Variable parameter (VP) HMM with parameters modeled as a polynomial function of the environment variable is developed. Parameter estimation based on EM-algorithm is given. With the same number of mixtures, VPHMM reduces WER by 40% compared to conventional multi-condition training.
Keywords :
hidden Markov models; optimisation; parameter estimation; polynomials; speech recognition; EM-algorithm; GMHMM; VPHMM; WER reduction; environment-dependent continuous variable; hidden Markov modeling; multicondition training; multiple Gaussian distributions; parameter estimation; polynomial function; recognition time; speech distribution; speech model training; speech recognition; speech signals; variable parameter Gaussian mixture; variable parameter HMM; word error rate reduction; Gaussian distribution; Gaussian noise; Hidden Markov models; Instruments; Noise level; Phase noise; Polynomials; Speech enhancement; Speech recognition; Working environment noise;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
Print_ISBN :
0-7803-7663-3
DOI :
10.1109/ICASSP.2003.1198704