Title :
Robust multimodal understanding
Author :
Bangalore, Srinivas ; Johnston, Michael
Author_Institution :
AT&T Labs.-Res., USA
Abstract :
Contemporary multimodal prototypes provide an excellent proof of concept but are not sufficiently robust in their handling of user input to be adopted by real users engaged in complex tasks. Our goal is to investigate techniques that improve the robustness of multimodal understanding to the point where effective multimodal interfaces can be made feasible. We present two different approaches - a pattern-matching based approach and a classification-based approach to robust multimodal understanding. We compare these approaches by evaluating them on data collected in the context of a multimodal conversational system.
Keywords :
graphical user interfaces; interactive systems; natural language interfaces; pattern classification; pattern matching; speech processing; speech-based user interfaces; classification; conversational system; graphical interaction; multimodal interfaces; pattern-matching; robust multimodal understanding; spoken interaction; Calendars; Cities and towns; Displays; Lattices; Navigation; Personnel; Prototypes; Robustness; Speech recognition; Working environment noise;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1326011