DocumentCode
2995468
Title
Building an application framework for speech and pen input integration in multimodal learning interfaces
Author
Vo, Minh Tue ; Wood, Cindy
Author_Institution
Interactive Syst. Labs., Carnegie Mellon Univ., Pittsburgh, PA, USA
Volume
6
fYear
1996
fDate
7-10 May 1996
Firstpage
3545
Abstract
While significant advances have been made improve speech recognition performance, and gesture and handwriting recognition, speech- and pen-based systems have still not found broad acceptance in everyday life. One reason for this is the inflexibility of each input modality when used alone. Human communication is very natural and flexible because we can take advantage of a multiplicity of communication signals working in concert to supply complementary information or increase robustness with redundancy. We present a multimodal interface capable of jointly interpreting speech, pen-based gestures, and handwriting in the context of an appointment scheduling application. The interpretation engine based on semantic frame merging correctly interprets 80% of a multimodal data set assuming perfect speech and gesture/handwriting recognition; in the presence of recognition errors the interpretation performance is in the range of 35-62%. A dialog processing scheme uses task domain knowledge to guide the user in supplying information and permits human-computer interactions to span several related multimodal input events
Keywords
character recognition; graphical user interfaces; learning (artificial intelligence); natural language interfaces; scheduling; speech recognition; application framework; appointment scheduling application; dialog processing scheme; handwriting; handwriting recognition; human-computer interactions; input modality; interpretation performance; multimodal learning interfaces; pen input integration; semantic frame merging; speech input integration; speech recognition; task domain knowledge; Calendars; Engines; Error correction; Handwriting recognition; Interactive systems; Laboratories; Merging; Personal digital assistants; Speech recognition; Writing;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location
Atlanta, GA
ISSN
1520-6149
Print_ISBN
0-7803-3192-3
Type
conf
DOI
10.1109/ICASSP.1996.550794
Filename
550794
Link To Document