• DocumentCode
    2541917
  • Title

    Improving of Acoustic Model for the Mongolian Speech Recognition System

  • Author

    Bao, Feilong ; Gao, Guanglai

  • Author_Institution
    Coll. of Comput. Sci., Inner Mongolia Univ., Hohhot, China
  • fYear
    2009
  • fDate
    4-6 Nov. 2009
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    The research of Mongolian speech recognition technology start comparatively late and it is still in its primary stage. In this paper, we optimized the basic resources of Mongolian speech recognition system, and we also improved the acoustic model of Mongolian speech recognition system, and this is most important. In this paper, we realized continuous HMM Gaussian mixture model and multiple data stream SCHMM model on the basis of context dependent phonetic model and decision tree method. And we compared the two models in performances. Finally, a large quantity of experiments have been taken to the testing set with HTK as an experimental platform by applying trigram language model and acoustic model which is composed of context dependent phonetic model, decision tree method and multiple data stream SCHMM model. We found system performance has been optimized, and system recognition accuracy rates of word and sentence have been greatly improved.
  • Keywords
    Gaussian processes; acoustic signal processing; decision trees; hidden Markov models; speech processing; speech recognition; Mongolian speech recognition system; acoustic model; context dependent phonetic model; continuous HMM Gaussian mixture model; decision tree method; multiple data stream SCHMM model; trigram language model; Acoustic testing; Computer science; Context modeling; Decision trees; Educational institutions; Electronic mail; Hidden Markov models; Phase change materials; Speech recognition; System performance;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition, 2009. CCPR 2009. Chinese Conference on
  • Conference_Location
    Nanjing
  • Print_ISBN
    978-1-4244-4199-0
  • Type

    conf

  • DOI
    10.1109/CCPR.2009.5344043
  • Filename
    5344043