DocumentCode
454623
Title
A Dempster-Shafer Based Fusion Approach for Audio-Visual Speech Recognition with Application to Large Vocabulary French Speech
Author
Foucher, S. ; Laliberté, F. ; Boulianne, G. ; Gagnon, L.
Author_Institution
Dept. of R&D, CRIM
Volume
1
fYear
2006
fDate
14-19 May 2006
Abstract
This work explores a new way of fusing audio and visual information for audio-visual automatic speech recognition in the context of a large vocabulary application. Mouth shape information is extracted off-line and integrated into a speech recognition system using a phoneme-based Dempster-Shafer fusion approach. The fusion methodology assumes that the audio information about the phonemes is a precise Bayesian source while the visual information is an imprecise evidential source. This ensures that the visual information does not degrade significantly the audio information in situation where the audio performs well in controlled noiseless environment. Bayesian and simple consonance belief structures are explored and compared, along with standard stack-based fusion
Keywords
Bayes methods; audio signal processing; image recognition; inference mechanisms; sensor fusion; speech processing; speech recognition; Bayesian source; audio-visual automatic speech recognition; consonance belief structures; large vocabulary French speech; mouth shape information; phoneme-based Dempster-Shafer fusion approach; stack-based fusion; Speech recognition; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location
Toulouse
ISSN
1520-6149
Print_ISBN
1-4244-0469-X
Type
conf
DOI
10.1109/ICASSP.2006.1660091
Filename
1660091
Link To Document