DocumentCode :
398695
Title :
Audio-visual speaker tracking with importance particle filters
Author :
Perez, Daniel Gatica ; Lathoud, Guillaume ; McCowan, Iain ; Odobez, Jean Marc ; Moore, Darren
Author_Institution :
Dalle Molle Inst. for Perceptual Artificial Intelligence, Switzerland
Volume :
3
fYear :
2003
fDate :
14-17 Sept. 2003
Abstract :
We present a probabilistic method for audio-visual (AV) speaker tracking, using an uncalibrated wide-angle camera and a micro- phone array. The algorithm fuses 2-D object shape and audio information via importance particle filters (I-PFs), allowing for the asymmetrical integration of AV information in a way that efficiently exploits the complementary features of each modality. Audio localization information is used to generate an importance sampling (IS) function, which guides the random search process of a particle filter towards regions of the configuration space likely to contain the true configuration (a speaker). The measurement process integrates contour-based and audio observations, which results in reliable head tracking in realistic scenarios. We show that imperfect single modalities can be combined into an algorithm that automatically initializes and tracks a speaker, switches between multiple speakers, tolerates visual clutter, and recovers from total AV object occlusion, in the context of a multimodal meeting room.
Keywords :
audio-visual systems; filters; importance sampling; probability; speaker recognition; 2D object shape; audio information; audio localization information; audio-visual speaker tracking; importance sampling function; microphone array; multiple speaker switching; object occlusion; particle filters; probabilistic method; random search process; uncalibrated wide-angle camera; visual clutter; Calibration; Cameras; Fuses; Monte Carlo methods; Particle filters; Particle tracking; Robustness; Sampling methods; Signal processing algorithms; Sliding mode control;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Image Processing, 2003. ICIP 2003. Proceedings. 2003 International Conference on
ISSN :
1522-4880
Print_ISBN :
0-7803-7750-8
Type :
conf
DOI :
10.1109/ICIP.2003.1247172
Filename :
1247172
Link To Document :
بازگشت