DocumentCode :
3405584
Title :
Audio informed visual speaker tracking with SMC-PHD filter
Author :
Kilic, Volkan ; Barnard, Mark ; Wenwu Wang ; Hilton, Adrian ; Kittler, Josef
Author_Institution :
Centre for Vision, Speech & Signal Process., Univ. of Surrey, Guildford, UK
fYear :
2015
fDate :
June 29 2015-July 3 2015
Firstpage :
1
Lastpage :
6
Abstract :
Sequential Monte Carlo probability hypothesis density (SMC-PHD) filter has received much interest in the field of nonlinear non-Gaussian visual tracking due to its ability to handle a variable number of speakers. The SMC-PHD filter employs surviving, spawned and born particles to model the state of the speakers and jointly estimates the variable number of speakers with their states. The born particles play a critical role in the detection of new speakers, which makes it necessary to propagate them in each frame. However, this increases the computational cost of the visual tracker. Here, we propose to use audio data to determine when to propagate the born particles and re-allocate the surviving and spawned particles. In our framework, we employ audio data as an aid to visual SMC-PHD (V-SMC-PHD) filter by using the direction of arrival (DOA) angles of the audio sources to reshape the distribution of the particles. Experimental results on the AV16:3 dataset with multi-speaker sequences show that our proposed audio-visual SMC-PHD (AV-SMC-PHD) filter improves the tracking performance in terms of estimation accuracy and computational efficiency.
Keywords :
Monte Carlo methods; audio signal processing; audio-visual systems; filtering theory; object tracking; probability; AV-SMC-PHD filter; DOA angles; V-SMC-PHD filter; audio data; audio informed visual speaker tracking; audio sources; audio-visual SMC-PHD filter; born particles; direction of arrival angles; nonlinear nonGaussian visual tracking; sequential Monte Carlo probability hypothesis density filter; spawned particles; surviving particles; Cameras; Computational efficiency; Direction-of-arrival estimation; Estimation; Image color analysis; Measurement; Visualization; Audio-visual tracking; PHD filter; SMC implementation; multi-speaker tracking;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo (ICME), 2015 IEEE International Conference on
Conference_Location :
Turin
Type :
conf
DOI :
10.1109/ICME.2015.7177497
Filename :
7177497
Link To Document :
بازگشت