DocumentCode :
700074
Title :
Using entropy as a stream reliability estimate for audio-visual speech recognition
Author :
Gurban, Mihai ; Thiran, Jean-Philippe
Author_Institution :
Signal Process. Lab. (LTS5), Ecole Polytech. Fed. de Lausanne (EPFL), Lausanne, Switzerland
fYear :
2008
fDate :
25-29 Aug. 2008
Firstpage :
1
Lastpage :
5
Abstract :
We present a method for dynamically integrating audiovisual information for speech recognition, based on the estimated reliability of the audio and visual streams. Our method uses an information theoretic measure, the entropy derived from the state probability distribution for each stream, as an estimate of reliability. The two modalities, audio and video, are weighted at each time instant according to their reliability. In this way, the weights vary dynamically and are able to adapt to any type of noise in each modality, and more importantly, to unexpected variations in the level of noise.
Keywords :
audio streaming; audio-visual systems; entropy; probability; reliability; speech recognition; audio stream; audio-visual speech recognition; entropy; estimated reliability; information theoretic measure; state probability distribution; stream reliability; visual stream; Entropy; Feature extraction; Hidden Markov models; Noise; Reliability; Speech recognition; Visualization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference, 2008 16th European
Conference_Location :
Lausanne
ISSN :
2219-5491
Type :
conf
Filename :
7080606
Link To Document :
بازگشت