DocumentCode
433016
Title
Discriminative lip-motion features for biometric speaker identification
Author
Cetingul, H.E. ; Yemez, Y. ; Erzin, E. ; Tekalp, A. Murat
Author_Institution
Coll. of Eng., Koc Univ., Istanbul, Turkey
Volume
3
fYear
2004
fDate
24-27 Oct. 2004
Firstpage
2023
Abstract
This paper addresses the selection of best lip motion features for biometric open-set speaker identification. The best features are those that result in the highest discrimination of individual speakers in a population. We first detect the face region in each video frame. The lip region for each frame is then segmented following registration of successive face regions by global motion compensation. The initial lip feature vector is composed of the 2D-DCT coefficients of the optical flow vectors within the lip region at each frame. The discriminant analysis is composed of two stages. At the first stage, the most discriminative features are selected from the full set of DCT coefficients of a single lip motion frame by using a probabilistic measure that maximizes the ratio of intra-class and inter-class probabilities. At the second stage, the resulting discriminative feature vectors are interpolated and concatenated for each time instant within a neighborhood, and further analyzed by LDA to reduce dimension, this time taking into account temporal discrimination information. Experimental results of the HMM-based speaker identification system are included to demonstrate the performance.
Keywords
biometrics (access control); discrete cosine transforms; hidden Markov models; image registration; image segmentation; motion compensation; probability; speaker recognition; video signal processing; 2D-DCT coefficient; HMM-based speaker identification system; biometric speaker identification; discriminant analysis; discriminative lip-motion feature; face region; interclass probability; intraclass probability; motion compensation; optical flow vector; probabilistic measure; successive face region registration; successive face region segmentation; video frame; Biomedical optical imaging; Biometrics; Concatenated codes; Discrete cosine transforms; Face detection; Image motion analysis; Information analysis; Linear discriminant analysis; Motion compensation; Motion measurement;
fLanguage
English
Publisher
ieee
Conference_Titel
Image Processing, 2004. ICIP '04. 2004 International Conference on
ISSN
1522-4880
Print_ISBN
0-7803-8554-3
Type
conf
DOI
10.1109/ICIP.2004.1421480
Filename
1421480
Link To Document