• DocumentCode
    134237
  • Title

    A multi-channel/multi-speaker articulatory database in Mandarin for speech visualization

  • Author

    Dan Zhang ; Xianqian Liu ; Nan Yan ; Lan Wang ; Yun Zhu ; Hui Chen

  • Author_Institution
    Shenzhen Inst. of Adv. Technol., Shenzhen, China
  • fYear
    2014
  • fDate
    12-14 Sept. 2014
  • Firstpage
    299
  • Lastpage
    303
  • Abstract
    The application of articulatory database in speech production and automatic speech recognition has been practiced for many years. The goal of the research was to build an articulatory database specifying in Chinese Mandarin production and to investigate its efficacy in speech animation. Carstens EMA AG501 device were respectively used to capture acoustic data and articulatory data. Also, a Microsoft Kinect camera was applied to capture face-tracking data as a supplement. Finally, we tried several methods to extract acoustic parameters and built up a 3D talking head model to verify the efficacy of the database.
  • Keywords
    computer animation; face recognition; image sensors; object tracking; speech recognition; 3D talking head model; Carstens EMA AG501 device; Chinese Mandarin production; Mandarin; Microsoft Kinect camera; acoustic data; articulatory data; automatic speech recognition; face-tracking data; multichannel-multispeaker articulatory database; speech animation; speech production; speech visualization; Acoustics; Databases; Sensors; Speech; Speech recognition; Three-dimensional displays; Tongue; EMA; Kinect camera; Mandarin; articulatory database;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
  • Conference_Location
    Singapore
  • Type

    conf

  • DOI
    10.1109/ISCSLP.2014.6936629
  • Filename
    6936629