• DocumentCode
    2376920
  • Title

    Designing a robust speech and gaze multimodal system for diverse users

  • Author

    Zhang, Qiaohui ; Go, Kentaro ; Imamiya, Atsumi ; Mao, Xiaoyang

  • Author_Institution
    Dept. of Comput. & Media Eng., Yamanashi Univ., Kofu, Japan
  • fYear
    2003
  • fDate
    27-29 Oct. 2003
  • Firstpage
    354
  • Lastpage
    361
  • Abstract
    The recognition errors make recognition-based systems brittle, and lead to usability problems. Multimodal system is generally believed as an effective means of being able to contribute to error avoidance and recovery. This work explores how to combine gaze and speech, which are two error-prone modes, in order to get a robust multimodal architecture. Combining the two overcomes imperfections of recognition techniques, compensates for drawbacks of a single mode, resolves the language ambiguity, and leads to a much more effective system. In addition, we try to employ a new performance criterion about the error-handling ability to analyze and assess the multimodal integration strategies. With this new measure approach, not only the benefits of mutual disambiguation of individual input signals within the multimodal architecture are demonstrated, but also the condition under which the multimodal system becomes the most effective is identified.
  • Keywords
    error handling; human computer interaction; image recognition; speech recognition; error avoidance; error recovery; error-handling ability; error-prone mode; eye tracking; gaze multimodal system; human computer interaction; multimodal integration; recognition error; recognition-based system; robust multimodal architecture; robust speech system; speech input; speech multimodal system; Computer architecture; Computer errors; Computer interfaces; Design engineering; Error analysis; Error correction; Mice; Performance analysis; Robustness; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Reuse and Integration, 2003. IRI 2003. IEEE International Conference on
  • Print_ISBN
    0-7803-8242-0
  • Type

    conf

  • DOI
    10.1109/IRI.2003.1251437
  • Filename
    1251437