DocumentCode
2512155
Title
Multimodal Human Computer Interaction with MIDAS Intelligent Infokiosk
Author
Karpov, Alexey ; Ronzhin, Andrey ; Kipyatkova, Irina ; Ronzhin, Alexander ; Akarun, Lale
Author_Institution
St. Petersburg Inst. for Inf. & Autom., RAS, St. Petersburg, Russia
fYear
2010
fDate
23-26 Aug. 2010
Firstpage
3862
Lastpage
3865
Abstract
In this paper, we present an intelligent information kiosk called MIDAS (Multimodal Interactive-Dialogue Automaton for Self-service), including its hardware and software architecture, stages of deployment of speech recognition and synthesis technologies. MIDAS uses the methodology Wizard of Oz (WOZ) that allows an expert to correct speech recognition results and control the dialogue flow. User statistics of the multimodal human computer interaction (HCI) have been analyzed for the operation of the kiosk in the automatic and automated modes. The infokiosk offers information about the structure and staff of laboratories, the location and phones of departments and employees of the institution. The multimodal user interface is provided with a touch screen, natural speech input and head and manual gestures, both for ordinary and physically handicapped users.
Keywords
human computer interaction; interactive systems; software architecture; speech recognition; speech synthesis; speech-based user interfaces; touch sensitive screens; HCI; MIDAS intelligent infokiosk; WOZ; Wizard of Oz; dialogue flow; hardware architecture; head gestures; intelligent information kiosk; manual gestures; multimodal human computer interaction; multimodal interactive-dialogue automaton for self-service; multimodal user interface; natural speech input; ordinary users; physically handicapped users; software architecture; speech recognition; speech synthesis technology; touch screen; Data models; Grammar; Hidden Markov models; Human computer interaction; Laboratories; Speech; Speech recognition; artificial intelligence; automatic speech recognition; human-computer interaction; infokiosk; multimodal user interfaces; speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location
Istanbul
ISSN
1051-4651
Print_ISBN
978-1-4244-7542-1
Type
conf
DOI
10.1109/ICPR.2010.941
Filename
5597644
Link To Document