DocumentCode :
263358
Title :
Audio-visual tracking of a variable number of speakers with a random finite set approach
Author :
Kilic, Volkan ; Xionghu Zhong ; Barnard, Mark ; Wenwu Wang ; Kittler, Josef
Author_Institution :
Dept. of Electron. Eng., Univ. of Surrey, Guildford, UK
fYear :
2014
fDate :
7-10 July 2014
Firstpage :
1
Lastpage :
7
Abstract :
Speaker tracking in smart environments has attracted an increasing amount of attention in the past few years. Our recent studies show that fusing audio and visual modalities can provide improved robustness and accuracy in some challenging tracking scenarios such as occlusions (by the limited field of view of cameras or by other speakers), as compared with the tracking system based on individual modalities. In these previous works, however, the number of speakers is assumed to be known and remains fixed over the tracking process. In this paper, we focus on a more realistic and complex scenario where the number of speakers is unknown and variable with time. We extend the random finite set (RFS) theory for multi-modal data and devise a particle filter algorithm under the RFS framework for audiovisual (AV) tracking. The experiments on the AV16.3 dataset show the capability of our proposed algorithm for tracking both the number of speakers and the positions of the speakers in challenging scenarios such as occlusions.
Keywords :
particle filtering (numerical methods); set theory; speaker recognition; target tracking; video signal processing; AV16.3 dataset; RFS framework; RFS theory; audio modalities; audio-visual tracking system; multimodal data; particle filter algorithm; random finite set approach; smart environments; speaker tracking process; visual modalities; Bayes methods; Cameras; Estimation; Histograms; Image color analysis; Time measurement; Visualization; Audio-visual speaker tracking; random finite set;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Fusion (FUSION), 2014 17th International Conference on
Conference_Location :
Salamanca
Type :
conf
Filename :
6916295
Link To Document :
بازگشت