• DocumentCode
    661480
  • Title

    Visualization of Mandarin articulation by using a physiological articulatory model

  • Author

    Dian Huang ; Xiyu Wu ; Jianguo Wei ; Hongcui Wang ; Chan Song ; Qingzhi Hou ; Jianwu Dang

  • Author_Institution
    Tianjin Key Lab. of Cognitive Comput. & Applic., Tianjin Univ., Tianjin, China
  • fYear
    2013
  • fDate
    Oct. 29 2013-Nov. 1 2013
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    It is difficult for language learners to produce unfamiliar speech sounds accurately because they may not manipulate articulatory movements precisely by auditory feedback alone. Visual feedback can help identify the errors and promote the learning progress, especially in language learning and speech rehabilitation fields. In this paper, we propose a visualization method for Mandarin phoneme pronunciation using a three-dimensional (3D) articulatory physiological model driven by Chinese Electromagnetic Articulographic (EMA) data. A mapping from EMA data to physiological articulatory model was constructed using three points on the mid-sagittal plane of the tongue. To do so, we analyzed configurations of 30 Chinese phonemes based on an EMA database. At the same time, we designed nearly 150,000 muscle activation patterns and applied them to the physiological model to generate model-based articulatory movements. As the result, we developed a visualized articulation system with 2.5 dimensional and 3D views respectively. The mapping was evaluated using MRI data. It is found that the mean deviation was about 0.21cm for seven vowels.
  • Keywords
    audio databases; biomedical MRI; data visualisation; natural language processing; physiology; speech processing; Chinese EMA data; Chinese electromagnetic articulographic data; Chinese phonemes; EMA database; MRI data; Mandarin articulation visualization; Mandarin phoneme pronunciation visualization method; articulatory movements; auditory feedback; language learning fields; mid-sagittal plane; model-based articulatory movements; muscle activation patterns; speech rehabilitation fields; three-dimensional articulatory physiological model; tongue; visual feedback; visualized articulation system; Data models; Data visualization; Physiology; Solid modeling; Speech; Three-dimensional displays; Tongue;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2013 Asia-Pacific
  • Conference_Location
    Kaohsiung
  • Type

    conf

  • DOI
    10.1109/APSIPA.2013.6694343
  • Filename
    6694343