Title :
Probabilistic integration of audiovisual information to localize sound source in human-robot interaction
Author :
Chen, Bin ; Meguro, Mitsuhiko ; Kaneko, Masahide
Author_Institution :
Dept. of Electron. Eng., Univ. of Electro-Commun., Tokyo, Japan
fDate :
31 Oct.-2 Nov. 2003
Abstract :
This paper proposes a method to estimate a sound source position by fusing the auditory and visual information with Bayesian network in human-robot interaction. We firstly integrate multi-channel audio signals and a depth image about the environment to generate a likelihood map for sound source localization. However, this integration, denoted by "MICs", does not always lead to locate a sound source correctly. For correcting the failure in localization, we integrate the likelihood values generated from "MICs" and the skin-color distribution in an image according to the result of classifying audio signal into speech/non-speech categories. The audio classifier is based on the support vector machine(SVM) and the skin-color distribution is modeled with GMM. With the evidences given by MICs, SVMs and GMM, we infer whether pixels in images correspond to sound source or not according to the trained Bayesian network. Finally, experimental results are presented to show the effectiveness of the proposed method.
Keywords :
acoustic generators; audio signal processing; audio-visual systems; belief networks; imaging; man-machine systems; probability; robots; speech; support vector machines; Bayesian network; audio classifier; audiovisual information; human-robot interaction; image pixels; likelihood map; multichannel audio signals; nonspeech categories; probabilistic integration; skin-color distribution; sound source localization; sound source position estimation; support vector machine; Acoustic noise; Bayesian methods; Cameras; Color; Face detection; Intelligent robots; Loudspeakers; Microphone arrays; Robot vision systems; Signal processing;
Conference_Titel :
Robot and Human Interactive Communication, 2003. Proceedings. ROMAN 2003. The 12th IEEE International Workshop on
Print_ISBN :
0-7803-8136-X
DOI :
10.1109/ROMAN.2003.1251850