• DocumentCode
    417182
  • Title

    Robust multimodal understanding

  • Author

    Bangalore, Srinivas ; Johnston, Michael

  • Author_Institution
    AT&T Labs.-Res., USA
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    Contemporary multimodal prototypes provide an excellent proof of concept but are not sufficiently robust in their handling of user input to be adopted by real users engaged in complex tasks. Our goal is to investigate techniques that improve the robustness of multimodal understanding to the point where effective multimodal interfaces can be made feasible. We present two different approaches - a pattern-matching based approach and a classification-based approach to robust multimodal understanding. We compare these approaches by evaluating them on data collected in the context of a multimodal conversational system.
  • Keywords
    graphical user interfaces; interactive systems; natural language interfaces; pattern classification; pattern matching; speech processing; speech-based user interfaces; classification; conversational system; graphical interaction; multimodal interfaces; pattern-matching; robust multimodal understanding; spoken interaction; Calendars; Cities and towns; Displays; Lattices; Navigation; Personnel; Prototypes; Robustness; Speech recognition; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1326011
  • Filename
    1326011