Title :
Multimodal saliency-based attention for object-based scene analysis
Author :
Schauerte, Boris ; Kühn, Benjamin ; Kroschel, Kristian ; Stiefelhagen, Rainer
Author_Institution :
Institute for Anthropomatics, Karlsruhe Institute of Technology (KIT), Adenauerring 2, 76131, Germany
Abstract :
Multimodal attention is a key requirement for humanoid robots in order to navigate in complex environments and act as social, cognitive human partners. To this end, robots have to incorporate attention mechanisms that focus the processing on the potentially most relevant stimuli while controlling the sensor orientation to improve the perception of these stimuli. In this paper, we present our implementation of audio-visual saliency-based attention that we integrated in a system for knowledge-driven audio-visual scene analysis and object-based world modeling. For this purpose, we introduce a novel isophote-based method for proto-object segmentation of saliency maps, a surprise-based auditory saliency definition, and a parametric 3-D model for multimodal saliency fusion. The applicability of the proposed system is demonstrated in a series of experiments.
Keywords :
Cameras; Computational modeling; Microphones; Robot sensing systems; Solid modeling; Visualization; active perception; audio-visual saliency; auditory surprise; hierarchical object analysis; isophote-based visual proto-objects; multimodal attention; object-based inhibition of return; overt attention; parametric 3-D saliency model; scene exploration;
Conference_Titel :
Intelligent Robots and Systems (IROS), 2011 IEEE/RSJ International Conference on
Conference_Location :
San Francisco, CA
Print_ISBN :
978-1-61284-454-1
DOI :
10.1109/IROS.2011.6095124