• DocumentCode
    3481279
  • Title

    The use of lip motion for biometric speaker identification

  • Author

    Çetingül, H.E. ; Yemez, Y. ; Erzin, E. ; Tekalp, A.M.

  • Author_Institution
    Koc Univ., Istanbul, Turkey
  • fYear
    2004
  • fDate
    28-30 April 2004
  • Firstpage
    148
  • Lastpage
    151
  • Abstract
    The paper addresses the selection of the best lip motion features for biometric open-set speaker identification. The best features are those that result in the highest discrimination of individual speakers in a population. We first detect the face region in each video frame. The lip region for each frame is then segmented following registration of successive face regions by global motion compensation. The initial lip feature vector is composed of the 2D-DCT coefficients of the optical flow vectors within the lip region at each frame. We propose to select the most discriminative features from the full set of transform coefficients by using a probabilistic measure that maximizes the ratio of intra-class and inter-class probabilities. The resulting discriminative feature vector with reduced dimension is expected to maximize the identification performance. Experimental results support that the resulting discriminative feature vector with reduced dimension improves the identification performance.
  • Keywords
    biometrics (access control); discrete cosine transforms; feature extraction; image registration; image segmentation; image sequences; motion compensation; optimisation; probability; speaker recognition; video signal processing; 2D-DCT coefficients; biometric speaker identification; discriminative feature vector; face region detection; face region registration; global motion compensation; inter-class probability; intra-class probability; lip feature vector; lip motion features; open-set speaker identification; optical flow vectors; probabilistic measure; transform coefficients; video frame segmentation; Biomedical optical imaging; Biometrics; Face detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Communications Applications Conference, 2004. Proceedings of the IEEE 12th
  • Print_ISBN
    0-7803-8318-4
  • Type

    conf

  • DOI
    10.1109/SIU.2004.1338280
  • Filename
    1338280