• DocumentCode
    2701233
  • Title

    An Evaluation of Audio-Visual Person Recognition on the XM2VTS Corpus using the Lausanne Protocols

  • Author

    Brady, K. ; Brandstein, M. ; Quatieri, T. ; Dunn, B.

  • Author_Institution
    MIT Lincoln Lab., Lexington, MA, USA
  • Volume
    4
  • fYear
    2007
  • fDate
    15-20 April 2007
  • Abstract
    A multimodal person recognition architecture has been developed for the purpose of improving overall recognition performance and for addressing channel-specific performance shortfalls. This multimodal architecture includes the fusion of a face recognition system with the MIT/LL GMM/UBM speaker recognition architecture. This architecture exploits the complementary and redundant nature of the face and speech modalities. The resulting multimodal architecture has been evaluated on the XM2VTS corpus using the Lausanne open set verification protocols, and demonstrates excellent recognition performance. The multimodal architecture also exhibits strong recognition performance gains over the performance of the individual modalities.
  • Keywords
    audio-visual systems; face recognition; protocols; speaker recognition; Lausanne protocols; XM2VTS corpus; audio-visual person recognition; face recognition system; multimodal person recognition architecture; speaker recognition architecture; verification protocols; Engines; Face detection; Face recognition; Feature extraction; Linear discriminant analysis; Protocols; Speaker recognition; Speech enhancement; Target recognition; Testing; Multimodal Person Recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
  • Conference_Location
    Honolulu, HI
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0727-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2007.366893
  • Filename
    4218081