• DocumentCode
    3109704
  • Title

    Thai ASR development for network-based speech translation

  • Author

    Wutiwiwatchai, Chai ; Thangthai, K. ; Sertsi, P.

  • Author_Institution
    Nat. Electron. & Comput. Technol. Center (NECTEC), Pathumthani, Thailand
  • fYear
    2012
  • fDate
    9-12 Dec. 2012
  • Firstpage
    92
  • Lastpage
    96
  • Abstract
    A network-based multilingual speech translation service under the Universal Speech Translation Advanced Research (U-STAR) consortium requires a well-tuned Thai automatic speech recognition (ASR) service. This paper summarizes the development of the service by utilizing both Thai read-speech and telephone speech (LOTUS-CELL 2.0) corpora. Tuning is performed regarding different sets of acoustic unit and training data. An evaluation shows that the recognition accuracy of ASR working over data channels can be improved by using the LOTUS-CELL 2.0 corpus although the corpus was constructed via voice channels. The problem of Named-entity (NE) words often found in the working domain is obvious and leads to an urgent future work.
  • Keywords
    language translation; natural language processing; speech recognition; ASR; LOTUS-CELL 2.0; NE; Thai ASR development; Thai automatic speech recognition service; Thai read-speech; U-STAR; named-entity words; network-based multilingual speech translation service; telephone speech; universal speech translation advanced research; Acoustics; Adaptation models; Data models; Hidden Markov models; Mobile handsets; Speech; Speech recognition; Thai ASR; mobile speech; speech translation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Speech Database and Assessments (Oriental COCOSDA), 2012 International Conference on
  • Conference_Location
    Macau
  • Print_ISBN
    978-1-4673-2811-1
  • Electronic_ISBN
    978-1-4673-2812-8
  • Type

    conf

  • DOI
    10.1109/ICSDA.2012.6422477
  • Filename
    6422477