• DocumentCode
    2512155
  • Title

    Multimodal Human Computer Interaction with MIDAS Intelligent Infokiosk

  • Author

    Karpov, Alexey ; Ronzhin, Andrey ; Kipyatkova, Irina ; Ronzhin, Alexander ; Akarun, Lale

  • Author_Institution
    St. Petersburg Inst. for Inf. & Autom., RAS, St. Petersburg, Russia
  • fYear
    2010
  • fDate
    23-26 Aug. 2010
  • Firstpage
    3862
  • Lastpage
    3865
  • Abstract
    In this paper, we present an intelligent information kiosk called MIDAS (Multimodal Interactive-Dialogue Automaton for Self-service), including its hardware and software architecture, stages of deployment of speech recognition and synthesis technologies. MIDAS uses the methodology Wizard of Oz (WOZ) that allows an expert to correct speech recognition results and control the dialogue flow. User statistics of the multimodal human computer interaction (HCI) have been analyzed for the operation of the kiosk in the automatic and automated modes. The infokiosk offers information about the structure and staff of laboratories, the location and phones of departments and employees of the institution. The multimodal user interface is provided with a touch screen, natural speech input and head and manual gestures, both for ordinary and physically handicapped users.
  • Keywords
    human computer interaction; interactive systems; software architecture; speech recognition; speech synthesis; speech-based user interfaces; touch sensitive screens; HCI; MIDAS intelligent infokiosk; WOZ; Wizard of Oz; dialogue flow; hardware architecture; head gestures; intelligent information kiosk; manual gestures; multimodal human computer interaction; multimodal interactive-dialogue automaton for self-service; multimodal user interface; natural speech input; ordinary users; physically handicapped users; software architecture; speech recognition; speech synthesis technology; touch screen; Data models; Grammar; Hidden Markov models; Human computer interaction; Laboratories; Speech; Speech recognition; artificial intelligence; automatic speech recognition; human-computer interaction; infokiosk; multimodal user interfaces; speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition (ICPR), 2010 20th International Conference on
  • Conference_Location
    Istanbul
  • ISSN
    1051-4651
  • Print_ISBN
    978-1-4244-7542-1
  • Type

    conf

  • DOI
    10.1109/ICPR.2010.941
  • Filename
    5597644