DocumentCode
2228351
Title
Multimodal fusion by adaptive compensation for feature uncertainty with application to audiovisual speech recognition
Author
Katsamanis, Athanassios ; Papandreou, George ; Pitsikalis, Vassilis ; Maragos, Petros
Author_Institution
Sch. of Electr. & Comput. Eng., Nat. Tech. Univ. of Athens, Athens, Greece
fYear
2006
fDate
4-8 Sept. 2006
Firstpage
1
Lastpage
5
Abstract
In pattern recognition one usually relies on measuring a set of informative features to perform tasks such as classification. While the accuracy of feature measurements heavily depends on changing environmental conditions, studying the consequences of this fact has received relatively little attention to date. In this work we explicitly take into account uncertainty in feature measurements and we show in a rigorous probabilistic framework how the models used for classification should be adjusted to compensate for this effect. Our approach proves to be particularly fruitful in multimodal fusion scenarios, such as audio-visual speech recognition, where multiple streams of complementary features are integrated. For such applications, provided that an estimate of the measurement noise uncertainty for each feature stream is available, we show that the proposed framework leads to highly adaptive multimodal fusion rules which are widely applicable and easy to implement. We further show that previous multimodal fusion methods relying on stream weights fall under our scheme if certain assumptions hold; this provides novel insights into their applicability for various tasks and suggests new practical ways for estimating the stream weights adaptively. Preliminary experimental results in audio-visual speech recognition demonstrate the potential of our approach.
Keywords
speech recognition; adaptive compensation; audio-visual speech recognition; feature uncertainty; measurement noise; multimodal fusion; pattern recognition; Face; Noise; Shape; Speech; Speech recognition; Uncertainty; Visualization;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2006 14th European
Conference_Location
Florence
ISSN
2219-5491
Type
conf
Filename
7071769
Link To Document