• DocumentCode
    2576765
  • Title

    Methods for improving robustness of decision tree in Mandarin speech recognition

  • Author

    Xu, Xianghua ; Zhu, Jie ; Guo, Qiang

  • Author_Institution
    Dept. of Electron. Eng., Shanghai Jiao Tong Univ., China
  • Volume
    3
  • fYear
    2004
  • fDate
    27-30 June 2004
  • Firstpage
    1975
  • Abstract
    Phonetic decision tree based state tying has been widely used in most large vocabulary continuous speech recognition (LVCSR) systems. However, in most cases, the samples of different leaf nodes are very unbalanced, which may affect the recognition performance. In This work, node merging techniques are proposed to alleviate the problem and further decrease the number of senones. On the other hand, in order to lessen the impact of rare triphones on the quality of the decision tree based state tying and improve the accuracy of every final senone, two methods of dealing with rare triphones are added to hidden Markov model (HMM) acoustic modeling before state tying. Experimental results show that these methods greatly improve the robustness of the decision tree and can achieve better performance with even fewer parameters.
  • Keywords
    decision trees; hidden Markov models; natural languages; speech recognition; HMM acoustic modeling; LVCSR; Mandarin speech recognition robustness; hidden Markov models; large vocabulary continuous speech recognition systems; leaf node sample balance; node merging techniques; phonetic decision tree based state tying; rare triphones; senones number reduction; Context modeling; Decision trees; Frequency; Hidden Markov models; Maximum likelihood estimation; Merging; Robustness; Speech recognition; Training data; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2004. ICME '04. 2004 IEEE International Conference on
  • Print_ISBN
    0-7803-8603-5
  • Type

    conf

  • DOI
    10.1109/ICME.2004.1394649
  • Filename
    1394649