DocumentCode
2701233
Title
An Evaluation of Audio-Visual Person Recognition on the XM2VTS Corpus using the Lausanne Protocols
Author
Brady, K. ; Brandstein, M. ; Quatieri, T. ; Dunn, B.
Author_Institution
MIT Lincoln Lab., Lexington, MA, USA
Volume
4
fYear
2007
fDate
15-20 April 2007
Abstract
A multimodal person recognition architecture has been developed for the purpose of improving overall recognition performance and for addressing channel-specific performance shortfalls. This multimodal architecture includes the fusion of a face recognition system with the MIT/LL GMM/UBM speaker recognition architecture. This architecture exploits the complementary and redundant nature of the face and speech modalities. The resulting multimodal architecture has been evaluated on the XM2VTS corpus using the Lausanne open set verification protocols, and demonstrates excellent recognition performance. The multimodal architecture also exhibits strong recognition performance gains over the performance of the individual modalities.
Keywords
audio-visual systems; face recognition; protocols; speaker recognition; Lausanne protocols; XM2VTS corpus; audio-visual person recognition; face recognition system; multimodal person recognition architecture; speaker recognition architecture; verification protocols; Engines; Face detection; Face recognition; Feature extraction; Linear discriminant analysis; Protocols; Speaker recognition; Speech enhancement; Target recognition; Testing; Multimodal Person Recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location
Honolulu, HI
ISSN
1520-6149
Print_ISBN
1-4244-0727-3
Type
conf
DOI
10.1109/ICASSP.2007.366893
Filename
4218081
Link To Document