• DocumentCode
    3570566
  • Title

    Estimating vocal tract shapes of Thai vowels from contextual vowel variation

  • Author

    Prom-on, Santitham ; Birkholz, Peter ; Yi Xu

  • Author_Institution
    Dept. of Comput. Eng., King Mongkut´s Univ. of Technol., Thonburi, Thailand
  • fYear
    2014
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    This paper presents a computational estimation of vocal tract shape parameters as articulatory targets of Thai vowels in an articulatory synthesizer, by means of analysis-by-synthesis with acoustic data as input. A speech corpus designed to capture the contextual variants of nine Thai long vowels, consisting of 81 disyllabic utterances, was recorded by a native Thai speaker. For each utterance, two targets, one for each syllable, were estimated by optimizing the target parameters to minimize the MFCC error between original and synthesized speech. An analysis-by-synthesis approach was used to iteratively optimize the shape parameters. The estimated targets of each vowel type were then averaged, resulting in nine articulatory targets, each corresponding to a vowel. The optimized targets were then used to synthesize Thai vowels both in monosyllables and in disyllabic sequences. The results indicate that the estimated targets effectively represent the underlying articulatory goals of Thai vowels.
  • Keywords
    estimation theory; natural language processing; speech synthesis; MFCC error; Thai vowels; acoustic data; analysis-by-synthesis; articulatory synthesizer; contextual vowel variation; disyllabic sequences; speech synthesis; vocal tract shapes estimation; Approximation methods; Hidden Markov models; Optimization; Shape; Speech; Synthesizers; Articulatory synthesis; Thai vowel; optimization; target approximation; vocal tract shape;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Co-ordination and Standardization of Speech Databases and Assessment Techniques (COCOSDA), 2014 17th Oriental Chapter of the International Committee for the
  • Type

    conf

  • DOI
    10.1109/ICSDA.2014.7051442
  • Filename
    7051442