Title :
A bi-lingual Thai-English TTS system on Android mobile devices
Author :
Saychum, S. ; Thangthai, A. ; Janjoi, P. ; Thatphithakkul, N. ; Wutiwiwatchai, C. ; Lamsrichan, P. ; Kobayashi, T.
Author_Institution :
TAIST Tokyo Tech, Kasetsart Univ., Bangkok, Thailand
Abstract :
This paper presents a bi-lingual Thai-English text-to-speech synthesis (TTS) system on Android mobile devices. The system deploys a Thai text processor and a well-known open-source English text processor, which can analyzes English text at high intelligibility. With hidden Markov model (HMM) based speech unit and audio streaming optimization, it can synthesize highly smoothed sounds at a fast response. This paper reveals the optimization of important components. Conditional random fields (CRF) successfully used in Thai word segmentation and a syllable-pattern based statistical modeling for Thai grapheme-to-phoneme conversion are assessed. Several types of speech parameters are compared for best performance. The optimized system produced as high as 3.68 mean opinion score (MOS) with response less than 2 seconds on both high and low specification devices.
Keywords :
audio streaming; hidden Markov models; mobile computing; natural language processing; operating systems (computers); public domain software; random processes; speech synthesis; word processing; Android mobile devices; CRF; HMM-based speech unit; Thai grapheme-to-phoneme conversion; Thai text processor; Thai word segmentation; audio streaming optimization; bilingual Thai-English TTS system; bilingual Thai-English text-to-speech synthesis system; conditional random fields; hidden Markov model; highly smoothed sound synthesis; mean opinion score; open source English text processor; speech parameters; syllable-pattern based statistical modeling; Engines; Hidden Markov models; Mobile handsets; Speech; Speech synthesis; Text processing; Time factors;
Conference_Titel :
Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON), 2012 9th International Conference on
Conference_Location :
Phetchaburi
Print_ISBN :
978-1-4673-2026-9
DOI :
10.1109/ECTICon.2012.6254283