• DocumentCode
    1749698
  • Title

    Efficient mixture Gaussian synthesis for decision tree based state tying

  • Author

    Kato, Tsuneo ; Kuroiwa, Shingo ; Shimizu, Tohru ; Higuchl, N.

  • Author_Institution
    KDD R&D Labs. Inc., Saitama, Japan
  • Volume
    1
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    493
  • Abstract
    We propose an efficient mixture Gaussian synthesis method for decision tree based state tying that produces better context-dependent models in a short period of training time. This method makes it possible to handle mixture Gaussian HMMs in the decision tree based state tying algorithm, and provides a higher recognition performance compared to the conventional HMM training procedure using decision tree based state tying on single Gaussian HMMs. This method also reduces the steps of the HMM training procedure because the mixture incrementing process is not necessary. We applied this method to the training of telephone speech triphones, and evaluated its effect on Japanese phonetically balanced sentence tasks. Our method achieved a 1 to 2 point improvement in phoneme accuracy and a 67% reduction in training time
  • Keywords
    Gaussian distribution; decision theory; hidden Markov models; speech recognition; trees (mathematics); HMM training; Japanese phonetically balanced sentence; context-dependent models; decision tree based state tying; efficient mixture Gaussian synthesis; mixture Gaussian HMM; mixture Gaussian distribution; phoneme accuracy; recognition performance; telephone speech triphones; telephone speech triphones training; training time reduction; Automatic speech recognition; Context modeling; Decision trees; Hidden Markov models; Laboratories; Robustness; Speech analysis; Telephony; Training data; Viterbi algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
  • Conference_Location
    Salt Lake City, UT
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7041-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2001.940875
  • Filename
    940875