Title :
Discriminative lip-motion features for biometric speaker identification
Author :
Cetingul, H.E. ; Yemez, Y. ; Erzin, E. ; Tekalp, A. Murat
Author_Institution :
Coll. of Eng., Koc Univ., Istanbul, Turkey
Abstract :
This paper addresses the selection of best lip motion features for biometric open-set speaker identification. The best features are those that result in the highest discrimination of individual speakers in a population. We first detect the face region in each video frame. The lip region for each frame is then segmented following registration of successive face regions by global motion compensation. The initial lip feature vector is composed of the 2D-DCT coefficients of the optical flow vectors within the lip region at each frame. The discriminant analysis is composed of two stages. At the first stage, the most discriminative features are selected from the full set of DCT coefficients of a single lip motion frame by using a probabilistic measure that maximizes the ratio of intra-class and inter-class probabilities. At the second stage, the resulting discriminative feature vectors are interpolated and concatenated for each time instant within a neighborhood, and further analyzed by LDA to reduce dimension, this time taking into account temporal discrimination information. Experimental results of the HMM-based speaker identification system are included to demonstrate the performance.
Keywords :
biometrics (access control); discrete cosine transforms; hidden Markov models; image registration; image segmentation; motion compensation; probability; speaker recognition; video signal processing; 2D-DCT coefficient; HMM-based speaker identification system; biometric speaker identification; discriminant analysis; discriminative lip-motion feature; face region; interclass probability; intraclass probability; motion compensation; optical flow vector; probabilistic measure; successive face region registration; successive face region segmentation; video frame; Biomedical optical imaging; Biometrics; Concatenated codes; Discrete cosine transforms; Face detection; Image motion analysis; Information analysis; Linear discriminant analysis; Motion compensation; Motion measurement;
Conference_Titel :
Image Processing, 2004. ICIP '04. 2004 International Conference on
Print_ISBN :
0-7803-8554-3
DOI :
10.1109/ICIP.2004.1421480