• DocumentCode
    3528851
  • Title

    Multi-feature audio-visual person recognition

  • Author

    Das, Amitav ; Manyam, Ohil K. ; Tapaswi, Makarand

  • Author_Institution
    Microsoft Res. India, Bangalore
  • fYear
    2008
  • fDate
    16-19 Oct. 2008
  • Firstpage
    227
  • Lastpage
    232
  • Abstract
    We propose a high-performance low-complexity audio-visual person recognition framework suitable for on-line user authentication for various web-applications which delivers robustness against various types of imposter attacks by capturing face and speech dynamics from the video of the user. Instead of using the traditional frontal-face image, a set of compressed face profile vectors are extracted from multiple poses of the person. Similarly, multiple user-selected passwords are used to create robustness against imposter attacks. A novel FGRAM-CFD speech feature is proposed which captures the identity of the user from the speech dynamics contained in the password. The novel signal processing methods proposed here for speech and face feature-extraction led to high discriminative power of the combined audio-visual features. This allowed the classifier to remain simple, yet delivering a reasonably high performance at significantly low complexity as demonstrated by our trials on a 210-user audio-visual biometric database created for this research.
  • Keywords
    audio-visual systems; face recognition; feature extraction; signal processing; speech recognition; audio-visual biometric database; audio-visual features; audio-visual person recognition framework; face feature-extraction; face profile vectors; frontal-face image; imposter attacks; multi-feature audio-visual person recognition; multiple user-selected passwords; on-line user authentication; signal processing; speech feature-extraction; Authentication; Biomedical signal processing; Biometrics; Face recognition; Image coding; Robustness; Spatial databases; Speech processing; Speech recognition; Video compression;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning for Signal Processing, 2008. MLSP 2008. IEEE Workshop on
  • Conference_Location
    Cancun
  • ISSN
    1551-2541
  • Print_ISBN
    978-1-4244-2375-0
  • Electronic_ISBN
    1551-2541
  • Type

    conf

  • DOI
    10.1109/MLSP.2008.4685484
  • Filename
    4685484