DocumentCode
1749698
Title
Efficient mixture Gaussian synthesis for decision tree based state tying
Author
Kato, Tsuneo ; Kuroiwa, Shingo ; Shimizu, Tohru ; Higuchl, N.
Author_Institution
KDD R&D Labs. Inc., Saitama, Japan
Volume
1
fYear
2001
fDate
2001
Firstpage
493
Abstract
We propose an efficient mixture Gaussian synthesis method for decision tree based state tying that produces better context-dependent models in a short period of training time. This method makes it possible to handle mixture Gaussian HMMs in the decision tree based state tying algorithm, and provides a higher recognition performance compared to the conventional HMM training procedure using decision tree based state tying on single Gaussian HMMs. This method also reduces the steps of the HMM training procedure because the mixture incrementing process is not necessary. We applied this method to the training of telephone speech triphones, and evaluated its effect on Japanese phonetically balanced sentence tasks. Our method achieved a 1 to 2 point improvement in phoneme accuracy and a 67% reduction in training time
Keywords
Gaussian distribution; decision theory; hidden Markov models; speech recognition; trees (mathematics); HMM training; Japanese phonetically balanced sentence; context-dependent models; decision tree based state tying; efficient mixture Gaussian synthesis; mixture Gaussian HMM; mixture Gaussian distribution; phoneme accuracy; recognition performance; telephone speech triphones; telephone speech triphones training; training time reduction; Automatic speech recognition; Context modeling; Decision trees; Hidden Markov models; Laboratories; Robustness; Speech analysis; Telephony; Training data; Viterbi algorithm;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location
Salt Lake City, UT
ISSN
1520-6149
Print_ISBN
0-7803-7041-4
Type
conf
DOI
10.1109/ICASSP.2001.940875
Filename
940875
Link To Document