• DocumentCode
    2995468
  • Title

    Building an application framework for speech and pen input integration in multimodal learning interfaces

  • Author

    Vo, Minh Tue ; Wood, Cindy

  • Author_Institution
    Interactive Syst. Labs., Carnegie Mellon Univ., Pittsburgh, PA, USA
  • Volume
    6
  • fYear
    1996
  • fDate
    7-10 May 1996
  • Firstpage
    3545
  • Abstract
    While significant advances have been made improve speech recognition performance, and gesture and handwriting recognition, speech- and pen-based systems have still not found broad acceptance in everyday life. One reason for this is the inflexibility of each input modality when used alone. Human communication is very natural and flexible because we can take advantage of a multiplicity of communication signals working in concert to supply complementary information or increase robustness with redundancy. We present a multimodal interface capable of jointly interpreting speech, pen-based gestures, and handwriting in the context of an appointment scheduling application. The interpretation engine based on semantic frame merging correctly interprets 80% of a multimodal data set assuming perfect speech and gesture/handwriting recognition; in the presence of recognition errors the interpretation performance is in the range of 35-62%. A dialog processing scheme uses task domain knowledge to guide the user in supplying information and permits human-computer interactions to span several related multimodal input events
  • Keywords
    character recognition; graphical user interfaces; learning (artificial intelligence); natural language interfaces; scheduling; speech recognition; application framework; appointment scheduling application; dialog processing scheme; handwriting; handwriting recognition; human-computer interactions; input modality; interpretation performance; multimodal learning interfaces; pen input integration; semantic frame merging; speech input integration; speech recognition; task domain knowledge; Calendars; Engines; Error correction; Handwriting recognition; Interactive systems; Laboratories; Merging; Personal digital assistants; Speech recognition; Writing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
  • Conference_Location
    Atlanta, GA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-3192-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1996.550794
  • Filename
    550794