DocumentCode :
2953645
Title :
Prediction-Based Gesture Detection in Lecture Videos by Combining Visual, Speech and Electronic Slides
Author :
Wang, Feng ; Ngo, Chong-Wah ; Pong, Ting-Chuen
Author_Institution :
Dept. of Comput. Sci., Hong Kong Univ. of Sci. & Technol., Kowloon
fYear :
2006
fDate :
9-12 July 2006
Firstpage :
653
Lastpage :
656
Abstract :
This paper presents an efficient algorithm for gesture detection in lecture videos by combining visual, speech and electronic slides. Besides accuracy, response time is also considered to cope with the efficiency requirements of real-time applications. Candidate gestures are first detected by visual cue. Then we modify HMM models for complete gestures to predict and recognize incomplete gestures before the whole gestures paths are observed. Gesture recognition is used to verify the results of gesture detection. The relations between visual, speech and slides are analyzed. The correspondence between speech and gesture is employed to improve the accuracy and the responsiveness of gesture detection
Keywords :
computer aided instruction; gesture recognition; hidden Markov models; speech recognition; video signal processing; HMM model; candidate gesture; electronic slides; gesture recognition; lecture video; prediction-based gesture detection; real-time application; speech slides; visual cue detection; visual slides; Application software; Cameras; Computer science; Delay; Electronic learning; Hidden Markov models; Multimedia communication; Predictive models; Speech analysis; Videos;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2006 IEEE International Conference on
Conference_Location :
Toronto, Ont.
Print_ISBN :
1-4244-0366-7
Electronic_ISBN :
1-4244-0367-7
Type :
conf
DOI :
10.1109/ICME.2006.262530
Filename :
4036684
Link To Document :
بازگشت