DocumentCode
263358
Title
Audio-visual tracking of a variable number of speakers with a random finite set approach
Author
Kilic, Volkan ; Xionghu Zhong ; Barnard, Mark ; Wenwu Wang ; Kittler, Josef
Author_Institution
Dept. of Electron. Eng., Univ. of Surrey, Guildford, UK
fYear
2014
fDate
7-10 July 2014
Firstpage
1
Lastpage
7
Abstract
Speaker tracking in smart environments has attracted an increasing amount of attention in the past few years. Our recent studies show that fusing audio and visual modalities can provide improved robustness and accuracy in some challenging tracking scenarios such as occlusions (by the limited field of view of cameras or by other speakers), as compared with the tracking system based on individual modalities. In these previous works, however, the number of speakers is assumed to be known and remains fixed over the tracking process. In this paper, we focus on a more realistic and complex scenario where the number of speakers is unknown and variable with time. We extend the random finite set (RFS) theory for multi-modal data and devise a particle filter algorithm under the RFS framework for audiovisual (AV) tracking. The experiments on the AV16.3 dataset show the capability of our proposed algorithm for tracking both the number of speakers and the positions of the speakers in challenging scenarios such as occlusions.
Keywords
particle filtering (numerical methods); set theory; speaker recognition; target tracking; video signal processing; AV16.3 dataset; RFS framework; RFS theory; audio modalities; audio-visual tracking system; multimodal data; particle filter algorithm; random finite set approach; smart environments; speaker tracking process; visual modalities; Bayes methods; Cameras; Estimation; Histograms; Image color analysis; Time measurement; Visualization; Audio-visual speaker tracking; random finite set;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Fusion (FUSION), 2014 17th International Conference on
Conference_Location
Salamanca
Type
conf
Filename
6916295
Link To Document