DocumentCode :
3380216
Title :
Toward multimodal interpretation in a natural speech/gesture interface
Author :
Kettebekov, Sanshzar ; Sharma, Rajeev
Author_Institution :
Dept. of Comput. Sci. & Eng., Pennsylvania State Univ., University Park, PA, USA
fYear :
1999
fDate :
1999
Firstpage :
328
Lastpage :
335
Abstract :
Hand gestures and speech comprise the most important modalities of human to human interaction. Motivated by this, there has been a considerable interest in incorporating these modalities for “natural” human-computer interaction (HCI) particularly within virtual environments. An important feature of such a natural interface would be an absence of predefined speech and gesture commands. The resulting bimodal speech/gesture HCI “language” would thus have to be interpreted by the computer. This involves challenge ranging from the low-level signal processing of bimodal (audio/video) input to the high level interpretation of natural speech/gesture in HCI. This paper identifies the issues of natural (non-prefixed) multimodal HCI interpretation. Since, in the natural interaction, gestures do not exhibit one-to-one mapping of their form to meaning, we specifically address problems associated with vision-based gesture interpretation in a multimodal interface. We consider the design of a speech/gesture interface in the context of a set of spatial tasks defined on a computerized campus map. The task context makes it possible to study the critical components of the multimodal interpretation and integration problem
Keywords :
gesture recognition; natural language interfaces; virtual reality; bimodal input; bimodal speech/gesture HCI language; computerized campus map; hand gestures; high level interpretation; low-level signal processing; multimodal interpretation; natural human-computer interaction; natural speech/gesture interface; spatial tasks; virtual environments; vision-based gesture interpretation; Augmented reality; Computer interfaces; Computer science; Hidden Markov models; Human computer interaction; Natural languages; Signal processing; Speech; Virtual environment; Virtual reality;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Intelligence and Systems, 1999. Proceedings. 1999 International Conference on
Conference_Location :
Bethesda, MD
Print_ISBN :
0-7695-0446-9
Type :
conf
DOI :
10.1109/ICIIS.1999.810285
Filename :
810285
Link To Document :
بازگشت