DocumentCode :
3315699
Title :
A vision-based microphone switch for speech intent detection
Author :
Iyengar, Giridharan ; Neti, Chalapathy
Author_Institution :
Human Language Technol., IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
fYear :
2001
fDate :
2001
Firstpage :
101
Lastpage :
105
Abstract :
We present our system for speech intent detection. In traditional desktop speech applications, the user has to explicitly indicate intent-to-speak to the computer by turning the microphone on. This is to alleviate problems associated with an open microphone in an automatic speech recognition system. In this paper, we use cues derived from user pose, proximity and visual speech activity to detect speech intent and enable automatic control of the microphone. We achieve real-time performance using pre-attentive cues to eliminate redundant computation
Keywords :
computer vision; gesture recognition; interactive systems; real-time systems; speech recognition; user interfaces; automatic speech recognition; real-time systems; speech intent detection; user pose recognition; vision-based microphone; Application software; Automatic speech recognition; Face detection; Humans; Lips; Microphones; Natural languages; Switches; Telephony; Turning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Recognition, Analysis, and Tracking of Faces and Gestures in Real-Time Systems, 2001. Proceedings. IEEE ICCV Workshop on
Conference_Location :
Vancouver, BC
ISSN :
1530-1044
Print_ISBN :
0-7695-1074-4
Type :
conf
DOI :
10.1109/RATFG.2001.938917
Filename :
938917
Link To Document :
بازگشت