DocumentCode
134237
Title
A multi-channel/multi-speaker articulatory database in Mandarin for speech visualization
Author
Dan Zhang ; Xianqian Liu ; Nan Yan ; Lan Wang ; Yun Zhu ; Hui Chen
Author_Institution
Shenzhen Inst. of Adv. Technol., Shenzhen, China
fYear
2014
fDate
12-14 Sept. 2014
Firstpage
299
Lastpage
303
Abstract
The application of articulatory database in speech production and automatic speech recognition has been practiced for many years. The goal of the research was to build an articulatory database specifying in Chinese Mandarin production and to investigate its efficacy in speech animation. Carstens EMA AG501 device were respectively used to capture acoustic data and articulatory data. Also, a Microsoft Kinect camera was applied to capture face-tracking data as a supplement. Finally, we tried several methods to extract acoustic parameters and built up a 3D talking head model to verify the efficacy of the database.
Keywords
computer animation; face recognition; image sensors; object tracking; speech recognition; 3D talking head model; Carstens EMA AG501 device; Chinese Mandarin production; Mandarin; Microsoft Kinect camera; acoustic data; articulatory data; automatic speech recognition; face-tracking data; multichannel-multispeaker articulatory database; speech animation; speech production; speech visualization; Acoustics; Databases; Sensors; Speech; Speech recognition; Three-dimensional displays; Tongue; EMA; Kinect camera; Mandarin; articulatory database;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
Conference_Location
Singapore
Type
conf
DOI
10.1109/ISCSLP.2014.6936629
Filename
6936629
Link To Document