Title :
Visual lip contour detection for the purpose of speech recognition
Author :
Dalka, Piotr ; Bratoszewski, Piotr ; Czyzewski, Andrzej
Author_Institution :
Multimedia Syst. Dept., Gdansk Univ. of Technol., Gdansk, Poland
Abstract :
A method for visual detection of lip contours in frontal recordings of speakers is described and evaluated. The purpose of the method is to facilitate speech recognition with visual features extracted from a mouth region. Different Active Appearance Models are employed for finding lips in video frames and for lip shape and texture statistical description. Search initialization procedure is proposed and error measure values are monitored in order to prevent the matching process from converging to a false local minimum. AAM-based visual features are applied in an experiment devoted to the static recognition of English vowels with SVM. Studies are carried out based on a database of recordings of 5 speakers of different skin colors. Results are thoroughly discussed and illustrated with figures.
Keywords :
feature extraction; image matching; image texture; speaker recognition; statistical analysis; support vector machines; AAM based visual feature extraction; English vowel static recognition; SVM; active appearance model; error measure value monitoring; lip shape; matching process; mouth region; search initialization procedure; skin color; speaker recording; speech recognition; texture statistical description; video frame; visual lip contour detection; Accuracy; Active appearance model; Feature extraction; Mouth; Shape; Speech recognition; Visualization; Active Appearance Models; SVM; lip localization; vowel recognition;
Conference_Titel :
Signals and Electronic Systems (ICSES), 2014 International Conference on
Conference_Location :
Poznan
DOI :
10.1109/ICSES.2014.6948716