DocumentCode
3485532
Title
Accent level adjustment in bilingual Thai-English text-to-speech synthesis
Author
Wutiwiwatchai, Chai ; Thangthai, Ausdang ; Chotimongkol, Ananlada ; Hansakunbuntheung, Chatchawam ; Thatphithakkul, Nattanun
Author_Institution
Human Language Technol. Lab., Nat. Electron. & Comput. Technol. Center (NECTEC), Pathumthani, Thailand
fYear
2011
fDate
11-15 Dec. 2011
Firstpage
295
Lastpage
299
Abstract
This paper introduces an accent level adjustment mechanism for Thai-English text-to-speech synthesis (TTS). English words often appearing in modern Thai writing can be speech synthesized by either Thai TTS using corresponding Thai phones or by separated English TTS using English phones. As many Thai native listeners may not prefer any of such extreme accent styles, a mechanism that allows selecting accent level preference is proposed. In HMM-based TTS, adjusting the accent level is done by interpolating HMMs of purely Thai and purely English sounds. Solutions for cross-language phone alignment and HMM state mapping are addressed. Evaluations are performed by a listening test on sounds synthesized with varied accent levels. Experimental results show that the proposed method is acceptable by the majority of human listeners.
Keywords
hidden Markov models; interpolation; natural language processing; speech synthesis; English TTS; English phones; HMM interpolation; HMM state mapping; Thai phones; Thai writing; accent level adjustment; bilingual Thai-English text-to-speech synthesis; cross-language phone alignment; Hidden Markov models; Interpolation; Nickel; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
Conference_Location
Waikoloa, HI
Print_ISBN
978-1-4673-0365-1
Electronic_ISBN
978-1-4673-0366-8
Type
conf
DOI
10.1109/ASRU.2011.6163947
Filename
6163947
Link To Document