DocumentCode :
2849349
Title :
A Low-Complexity Dynamic Face-Voice Feature Fusion Approach to Multimodal Person Recognition
Author :
Shah, Dhaval ; Han, Kyu J. ; Narayanan, Shrikanth S.
Author_Institution :
Ming Hsieh Dept. of Electr. Eng., Univ. of Southern California, Los Angeles, CA, USA
fYear :
2009
fDate :
14-16 Dec. 2009
Firstpage :
24
Lastpage :
31
Abstract :
In this paper, we show the importance of face-voice correlation for audio-visual person recognition. We evaluate the performance of a system which uses the correlation between audio-visual features during speech against audio-only, video-only and audio-visual systems which use audio and visual features independently neglecting the interdependency of a person´s spoken utterance and the associated facial movements. Experiments performed on the Vid-TIMIT dataset show that the proposed multimodal scheme has lower error rate than all other comparison conditions and is more robust against replay attacks. The simplicity of the fusion technique also allows the use of only one classifier which greatly simplifies system design and allows for a simple real-time DSP implementation.
Keywords :
biometrics (access control); face recognition; feature extraction; gesture recognition; image classification; sensor fusion; speaker recognition; audio-visual person recognition; dynamic face-voice feature fusion; face-voice correlation; facial movement; image classification; multimodal person recognition; multimodal scheme; replay attack; speech features; spoken utterance; Audio-visual systems; Biometrics; Digital signal processing; Face recognition; Feature extraction; Information security; Iris; Robustness; Usability; Viterbi algorithm; audio-visual; biometric; multimodal; speaker;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia, 2009. ISM '09. 11th IEEE International Symposium on
Conference_Location :
San Diego, CA
Print_ISBN :
978-1-4244-5231-6
Electronic_ISBN :
978-0-7695-3890-7
Type :
conf
DOI :
10.1109/ISM.2009.78
Filename :
5365281
Link To Document :
بازگشت