DocumentCode :
454623
Title :
A Dempster-Shafer Based Fusion Approach for Audio-Visual Speech Recognition with Application to Large Vocabulary French Speech
Author :
Foucher, S. ; Laliberté, F. ; Boulianne, G. ; Gagnon, L.
Author_Institution :
Dept. of R&D, CRIM
Volume :
1
fYear :
2006
fDate :
14-19 May 2006
Abstract :
This work explores a new way of fusing audio and visual information for audio-visual automatic speech recognition in the context of a large vocabulary application. Mouth shape information is extracted off-line and integrated into a speech recognition system using a phoneme-based Dempster-Shafer fusion approach. The fusion methodology assumes that the audio information about the phonemes is a precise Bayesian source while the visual information is an imprecise evidential source. This ensures that the visual information does not degrade significantly the audio information in situation where the audio performs well in controlled noiseless environment. Bayesian and simple consonance belief structures are explored and compared, along with standard stack-based fusion
Keywords :
Bayes methods; audio signal processing; image recognition; inference mechanisms; sensor fusion; speech processing; speech recognition; Bayesian source; audio-visual automatic speech recognition; consonance belief structures; large vocabulary French speech; mouth shape information; phoneme-based Dempster-Shafer fusion approach; stack-based fusion; Speech recognition; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
ISSN :
1520-6149
Print_ISBN :
1-4244-0469-X
Type :
conf
DOI :
10.1109/ICASSP.2006.1660091
Filename :
1660091
Link To Document :
بازگشت