DocumentCode
3109704
Title
Thai ASR development for network-based speech translation
Author
Wutiwiwatchai, Chai ; Thangthai, K. ; Sertsi, P.
Author_Institution
Nat. Electron. & Comput. Technol. Center (NECTEC), Pathumthani, Thailand
fYear
2012
fDate
9-12 Dec. 2012
Firstpage
92
Lastpage
96
Abstract
A network-based multilingual speech translation service under the Universal Speech Translation Advanced Research (U-STAR) consortium requires a well-tuned Thai automatic speech recognition (ASR) service. This paper summarizes the development of the service by utilizing both Thai read-speech and telephone speech (LOTUS-CELL 2.0) corpora. Tuning is performed regarding different sets of acoustic unit and training data. An evaluation shows that the recognition accuracy of ASR working over data channels can be improved by using the LOTUS-CELL 2.0 corpus although the corpus was constructed via voice channels. The problem of Named-entity (NE) words often found in the working domain is obvious and leads to an urgent future work.
Keywords
language translation; natural language processing; speech recognition; ASR; LOTUS-CELL 2.0; NE; Thai ASR development; Thai automatic speech recognition service; Thai read-speech; U-STAR; named-entity words; network-based multilingual speech translation service; telephone speech; universal speech translation advanced research; Acoustics; Adaptation models; Data models; Hidden Markov models; Mobile handsets; Speech; Speech recognition; Thai ASR; mobile speech; speech translation;
fLanguage
English
Publisher
ieee
Conference_Titel
Speech Database and Assessments (Oriental COCOSDA), 2012 International Conference on
Conference_Location
Macau
Print_ISBN
978-1-4673-2811-1
Electronic_ISBN
978-1-4673-2812-8
Type
conf
DOI
10.1109/ICSDA.2012.6422477
Filename
6422477
Link To Document