Title :
Efficient mixture Gaussian synthesis for decision tree based state tying
Author :
Kato, Tsuneo ; Kuroiwa, Shingo ; Shimizu, Tohru ; Higuchl, N.
Author_Institution :
KDD R&D Labs. Inc., Saitama, Japan
Abstract :
We propose an efficient mixture Gaussian synthesis method for decision tree based state tying that produces better context-dependent models in a short period of training time. This method makes it possible to handle mixture Gaussian HMMs in the decision tree based state tying algorithm, and provides a higher recognition performance compared to the conventional HMM training procedure using decision tree based state tying on single Gaussian HMMs. This method also reduces the steps of the HMM training procedure because the mixture incrementing process is not necessary. We applied this method to the training of telephone speech triphones, and evaluated its effect on Japanese phonetically balanced sentence tasks. Our method achieved a 1 to 2 point improvement in phoneme accuracy and a 67% reduction in training time
Keywords :
Gaussian distribution; decision theory; hidden Markov models; speech recognition; trees (mathematics); HMM training; Japanese phonetically balanced sentence; context-dependent models; decision tree based state tying; efficient mixture Gaussian synthesis; mixture Gaussian HMM; mixture Gaussian distribution; phoneme accuracy; recognition performance; telephone speech triphones; telephone speech triphones training; training time reduction; Automatic speech recognition; Context modeling; Decision trees; Hidden Markov models; Laboratories; Robustness; Speech analysis; Telephony; Training data; Viterbi algorithm;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
Print_ISBN :
0-7803-7041-4
DOI :
10.1109/ICASSP.2001.940875