DocumentCode :
2701233
Title :
An Evaluation of Audio-Visual Person Recognition on the XM2VTS Corpus using the Lausanne Protocols
Author :
Brady, K. ; Brandstein, M. ; Quatieri, T. ; Dunn, B.
Author_Institution :
MIT Lincoln Lab., Lexington, MA, USA
Volume :
4
fYear :
2007
fDate :
15-20 April 2007
Abstract :
A multimodal person recognition architecture has been developed for the purpose of improving overall recognition performance and for addressing channel-specific performance shortfalls. This multimodal architecture includes the fusion of a face recognition system with the MIT/LL GMM/UBM speaker recognition architecture. This architecture exploits the complementary and redundant nature of the face and speech modalities. The resulting multimodal architecture has been evaluated on the XM2VTS corpus using the Lausanne open set verification protocols, and demonstrates excellent recognition performance. The multimodal architecture also exhibits strong recognition performance gains over the performance of the individual modalities.
Keywords :
audio-visual systems; face recognition; protocols; speaker recognition; Lausanne protocols; XM2VTS corpus; audio-visual person recognition; face recognition system; multimodal person recognition architecture; speaker recognition architecture; verification protocols; Engines; Face detection; Face recognition; Feature extraction; Linear discriminant analysis; Protocols; Speaker recognition; Speech enhancement; Target recognition; Testing; Multimodal Person Recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Type :
conf
DOI :
10.1109/ICASSP.2007.366893
Filename :
4218081
Link To Document :
بازگشت