• DocumentCode
    857487
  • Title

    Robust Biometric Person Identification Using Automatic Classifier Fusion of Speech, Mouth, and Face Experts

  • Author

    Fox, Niall A. ; Gross, Ralph ; Cohn, Jeffrey F. ; Reilly, Richard B.

  • Author_Institution
    MMSP Lab., Univ. Coll. Dublin
  • Volume
    9
  • Issue
    4
  • fYear
    2007
  • fDate
    6/1/2007 12:00:00 AM
  • Firstpage
    701
  • Lastpage
    714
  • Abstract
    Information about person identity is multimodal. Yet, most person-recognition systems limit themselves to only a single modality, such as facial appearance. With a view to exploiting the complementary nature of different modes of information and increasing pattern recognition robustness to test signal degradation, we developed a multiple expert biometric person identification system that combines information from three experts: audio, visual speech, and face. The system uses multimodal fusion in an automatic unsupervised manner, adapting to the local performance (at the transaction level) and output reliability of each of the three experts. The expert weightings are chosen automatically such that the reliability measure of the combined scores is maximized. To test system robustness to train/test mismatch, we used a broad range of acoustic babble noise and JPEG compression to degrade the audio and visual signals, respectively. Identification experiments were carried out on a 248-subject subset of the XM2VTS database. The multimodal expert system outperformed each of the single experts in all comparisons. At severe audio and visual mismatch levels tested, the audio, mouth, face, and tri-expert fusion accuracies were 16.1%, 48%, 75%, and 89.9%, respectively, representing a relative improvement of 19.9% over the best performing expert
  • Keywords
    audio signal processing; biometrics (access control); face recognition; hidden Markov models; image fusion; speaker recognition; unsupervised learning; JPEG compression; acoustic babble noise; audio signal; automatic classifier fusion; face recognition; hidden Markov model; multimodal expert system; multiple expert biometric person identification; pattern recognition; visual speech signal; Biometric fusion; expert reliability; hidden Markov models; image information loss; mouth features; multimodal; person recognition; robustness; tri-expert;
  • fLanguage
    English
  • Journal_Title
    Multimedia, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1520-9210
  • Type

    jour

  • DOI
    10.1109/TMM.2007.893339
  • Filename
    4202588