Title :
Multi-feature audio-visual person recognition
Author :
Das, Amitav ; Manyam, Ohil K. ; Tapaswi, Makarand
Author_Institution :
Microsoft Res. India, Bangalore
Abstract :
We propose a high-performance low-complexity audio-visual person recognition framework suitable for on-line user authentication for various web-applications which delivers robustness against various types of imposter attacks by capturing face and speech dynamics from the video of the user. Instead of using the traditional frontal-face image, a set of compressed face profile vectors are extracted from multiple poses of the person. Similarly, multiple user-selected passwords are used to create robustness against imposter attacks. A novel FGRAM-CFD speech feature is proposed which captures the identity of the user from the speech dynamics contained in the password. The novel signal processing methods proposed here for speech and face feature-extraction led to high discriminative power of the combined audio-visual features. This allowed the classifier to remain simple, yet delivering a reasonably high performance at significantly low complexity as demonstrated by our trials on a 210-user audio-visual biometric database created for this research.
Keywords :
audio-visual systems; face recognition; feature extraction; signal processing; speech recognition; audio-visual biometric database; audio-visual features; audio-visual person recognition framework; face feature-extraction; face profile vectors; frontal-face image; imposter attacks; multi-feature audio-visual person recognition; multiple user-selected passwords; on-line user authentication; signal processing; speech feature-extraction; Authentication; Biomedical signal processing; Biometrics; Face recognition; Image coding; Robustness; Spatial databases; Speech processing; Speech recognition; Video compression;
Conference_Titel :
Machine Learning for Signal Processing, 2008. MLSP 2008. IEEE Workshop on
Conference_Location :
Cancun
Print_ISBN :
978-1-4244-2375-0
Electronic_ISBN :
1551-2541
DOI :
10.1109/MLSP.2008.4685484