Title :
Evaluating integrated speech- and image understanding
Author :
Bauckhage, C. ; Fritsch, J. ; Rohifing, K.J. ; Wachsmuth, S. ; Sagerer, G.
Author_Institution :
Tech. Fac., Bielefeld Univ., Germany
Abstract :
The capability to coordinate and interrelate speech and vision is a virtual prerequisite for adaptive, cooperative, and flexible interaction among people. It is therefore fair to assume that human-machine interaction, too, would benefit from intelligent interfaces for integrated speech and image processing. We first sketch an interactive system that integrates automatic speech processing with image understanding. Then, we concentrate on performance assessment which we believe is an emerging key issue in multimodal interaction. We explain the benefit of time scale analysis and usability studies and evaluate our system accordingly.
Keywords :
human factors; image recognition; interactive systems; performance evaluation; speech recognition; user interfaces; flexible interaction; human-machine interaction; image processing; image understanding; integrated speech-image understanding; intelligent interfaces; interactive system; multimodal interaction; performance assessment; speech processing; time scale analysis; usability; vision; Assembly systems; Image processing; Image recognition; Intelligent robots; Interactive systems; Man machine systems; Robustness; Speech processing; Speech recognition; Usability;
Conference_Titel :
Multimodal Interfaces, 2002. Proceedings. Fourth IEEE International Conference on
Print_ISBN :
0-7695-1834-6
DOI :
10.1109/ICMI.2002.1166961