Title :
A multi-channel/multi-speaker articulatory database in Mandarin for speech visualization
Author :
Dan Zhang ; Xianqian Liu ; Nan Yan ; Lan Wang ; Yun Zhu ; Hui Chen
Author_Institution :
Shenzhen Inst. of Adv. Technol., Shenzhen, China
Abstract :
The application of articulatory database in speech production and automatic speech recognition has been practiced for many years. The goal of the research was to build an articulatory database specifying in Chinese Mandarin production and to investigate its efficacy in speech animation. Carstens EMA AG501 device were respectively used to capture acoustic data and articulatory data. Also, a Microsoft Kinect camera was applied to capture face-tracking data as a supplement. Finally, we tried several methods to extract acoustic parameters and built up a 3D talking head model to verify the efficacy of the database.
Keywords :
computer animation; face recognition; image sensors; object tracking; speech recognition; 3D talking head model; Carstens EMA AG501 device; Chinese Mandarin production; Mandarin; Microsoft Kinect camera; acoustic data; articulatory data; automatic speech recognition; face-tracking data; multichannel-multispeaker articulatory database; speech animation; speech production; speech visualization; Acoustics; Databases; Sensors; Speech; Speech recognition; Three-dimensional displays; Tongue; EMA; Kinect camera; Mandarin; articulatory database;
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
Conference_Location :
Singapore
DOI :
10.1109/ISCSLP.2014.6936629