Title :
Mouth motion analysis with space-time interest points
Author :
Hojo, Hiroshi ; Hamada, Nozomu
Author_Institution :
Hamada Lab., Keio Univ., Yokohama, Japan
Abstract :
Speech recognition and speaker detection technique from audio visual fusion information attract much attention. In the visual side information, namely lip reading area, most of recent studies are based on analyzing shape of mouth, whereas few studies are based on analyzing lip motion. However, analysis associated with mouth motion gives essential cues for obtaining utterance mechanics. Thus, as a tool to analyze mouth motion, we focus attention on space-time interest points (STIP) that have been effectively applied for analyzing gait and for recognizing human action. This study stems from the idea that the STIP must be also useful for mouth motion analysis. The proposed mouth motion analysis system using STIP needs neither contour estimation nor feature tracking. Then, several image processings are proposed in order to appropriately apply STIP to mouth motion. Additionally, to evaluate the detected STIP as a tool for mouth motion analysis, we classified Japanese vowels utterances into three motion types by using detected STIPs.
Keywords :
feature extraction; gait analysis; gesture recognition; image motion analysis; natural language processing; Japanese vowels utterances; audio visual fusion information; contour estimation; feature tracking; gait analysis; human action recognition; image processings; lip reading area; mouth motion analysis; space-time interest points; speaker detection technique; speech recognition; utterance mechanics; visual side information; Humans; Image motion analysis; Information analysis; Motion analysis; Motion detection; Motion estimation; Mouth; Shape; Speech recognition; Tracking;
Conference_Titel :
TENCON 2009 - 2009 IEEE Region 10 Conference
Conference_Location :
Singapore
Print_ISBN :
978-1-4244-4546-2
Electronic_ISBN :
978-1-4244-4547-9
DOI :
10.1109/TENCON.2009.5395919