DocumentCode :
3413710
Title :
Audiovisual-to-articulatory speech inversion using Active Appearance Models for the face and Hidden Markov Models for the dynamics
Author :
Katsamanis, Athanassios ; Papandreou, George ; Maragos, Petros
Author_Institution :
Sch. of E.C.E., Nat. Tech. Univ. of Athens, Athens
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
2237
Lastpage :
2240
Abstract :
We are interested in recovering aspects of vocal tract´s geometry and dynamics from auditory and visual speech cues. We approach the problem in a statistical framework based on Hidden Markov Models and demonstrate effective estimation of the trajectories followed by certain points of interest in the speech production system. Alternative fusion schemes are investigated to account for asynchrony between the modalities and allow independent modeling of the dynamics of the involved streams. Visual cues are extracted from the speaker´s face by means of active appearance modeling. We report experiments on the QSMT database which contains audio, video, and electromagnetic articulography data recorded in parallel. The results show that exploiting both audio and visual modalities in a multistream HMM based scheme clearly improves performance relative to either audio or visual-only estimation.
Keywords :
face recognition; hidden Markov models; speech processing; active appearance model; audiovisual-to-articulatory speech inversion; face model; hidden Markov model; vocal tract dynamics; vocal tract geometry; Acoustics; Active appearance model; Audio databases; Frequency estimation; Hidden Markov models; Image databases; Solid modeling; Speech coding; Speech processing; Tongue; Hidden Markov Models; articulatory; audiovisual; fusion; speech inversion;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
ISSN :
1520-6149
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2008.4518090
Filename :
4518090
Link To Document :
بازگشت