• DocumentCode
    3485532
  • Title

    Accent level adjustment in bilingual Thai-English text-to-speech synthesis

  • Author

    Wutiwiwatchai, Chai ; Thangthai, Ausdang ; Chotimongkol, Ananlada ; Hansakunbuntheung, Chatchawam ; Thatphithakkul, Nattanun

  • Author_Institution
    Human Language Technol. Lab., Nat. Electron. & Comput. Technol. Center (NECTEC), Pathumthani, Thailand
  • fYear
    2011
  • fDate
    11-15 Dec. 2011
  • Firstpage
    295
  • Lastpage
    299
  • Abstract
    This paper introduces an accent level adjustment mechanism for Thai-English text-to-speech synthesis (TTS). English words often appearing in modern Thai writing can be speech synthesized by either Thai TTS using corresponding Thai phones or by separated English TTS using English phones. As many Thai native listeners may not prefer any of such extreme accent styles, a mechanism that allows selecting accent level preference is proposed. In HMM-based TTS, adjusting the accent level is done by interpolating HMMs of purely Thai and purely English sounds. Solutions for cross-language phone alignment and HMM state mapping are addressed. Evaluations are performed by a listening test on sounds synthesized with varied accent levels. Experimental results show that the proposed method is acceptable by the majority of human listeners.
  • Keywords
    hidden Markov models; interpolation; natural language processing; speech synthesis; English TTS; English phones; HMM interpolation; HMM state mapping; Thai phones; Thai writing; accent level adjustment; bilingual Thai-English text-to-speech synthesis; cross-language phone alignment; Hidden Markov models; Interpolation; Nickel; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
  • Conference_Location
    Waikoloa, HI
  • Print_ISBN
    978-1-4673-0365-1
  • Electronic_ISBN
    978-1-4673-0366-8
  • Type

    conf

  • DOI
    10.1109/ASRU.2011.6163947
  • Filename
    6163947