• DocumentCode
    2161920
  • Title

    Speaker authentication using video-based lip information

  • Author

    Goswami, B. ; Chan, C. ; Kittler, J. ; Christmas, W.

  • Author_Institution
    FEPS, Univ. of Surrey, Guildford, UK
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    1908
  • Lastpage
    1911
  • Abstract
    The lip-region can be interpreted as either a genetic or behavioural biometric trait depending on whether static or dynamic information is used. In this paper, we use a texture descriptor called Local Ordinal Contrast Pattern (LOCP) in conjunction with a novel spatiotemporal sampling method called Windowed Three Orthogonal Planes (WTOP) to represent both appearance and dynamics features ob served in visual speech. This representation, with standard speaker verification engines, is shown to improve the performance of the lip biometric trait compared to the state-of-the-art. The improvement obtained suggests that there is enough discriminative information in the mouth-region to enable its use as a primary biometric as opposed to a "soft" biometric trait.
  • Keywords
    audio-visual systems; biometrics (access control); spatiotemporal phenomena; speaker recognition; video signal processing; behavioural biometric trait; dynamic video information; genetic biometric trait; lip; local ordinal contrast pattern; spatiotemporal sampling method; speaker authentication; static video information; texture descriptor; visual speech; windowed three orthogonal planes; Databases; Feature extraction; Histograms; Pixel; Spatiotemporal phenomena; Speech; Yttrium; Biometrics; lip; spatiotemporal;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5946880
  • Filename
    5946880