DocumentCode :
530116
Title :
Classification of visemes using visual cues
Author :
Alothmany, Nazeeh ; Boston, Robert ; Li, Ching ; Shaiman, Susan ; Durrant, John
Author_Institution :
ECE Dept., King Abdulaziz Univ., Jeddah, Saudi Arabia
fYear :
2010
fDate :
15-17 Sept. 2010
Firstpage :
345
Lastpage :
349
Abstract :
Psycho-acoustic tests have indicated that human vision classifies the visemes (visual representation of phonemes) into different classes. This study shows that visual features extracted from 2-D images of lip motion can be used to design an automatic classifier for visemes. Audio-visual recordings from 18 native speakers of American English for 12 Vowel-Consonant-Vowel (VCV) sounds were obtained using the consonants /b,v,w,õ,d,z/ and the vowels /a,i/. The lip height, lip width, motion of the upper lip and the rate at which lips move while producing the VCV words were visual features used to represent each VCV sound. Features extracted from nine of the speakers were used to define Linear Discriminant Analysis functions to classify the visemes and features extracted from the remaining nine speakers were used in testing the classifiers. When the VCV sounds were divided into five classes consistent with those obtained from psycho-acoustic tests, the percentage of correct classification was 72.1% in training and 69.3% in testing. When each VCV sequence was treated as an independent class, resulting in 12 classes, the percentage of correct recognition was 55.3% in the training set and 43.1% in the testing set.
Keywords :
face recognition; feature extraction; pattern classification; speech recognition; statistical analysis; American English; audio-visual recordings; automatic classifier; linear discriminant analysis functions; lip motion; psycho-acoustic tests; visemes; visual cues; visual features; Feature extraction; Lips; Speech; Speech recognition; Testing; Training; Visualization; component; visemes; visual classification of phonemes; visual phoneme; visual vues;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
ELMAR, 2010 PROCEEDINGS
Conference_Location :
Zadar
ISSN :
1334-2630
Print_ISBN :
978-1-4244-6371-8
Electronic_ISBN :
1334-2630
Type :
conf
Filename :
5606142
Link To Document :
بازگشت