DocumentCode :
3404817
Title :
Tracking a varying number of speakers using particle filtering
Author :
Quintan, A. ; Asano, F.
Author_Institution :
Inf. Technol. Res. Inst., AIST, Tsukuba
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
297
Lastpage :
300
Abstract :
The extension of particle filtering techniques to the multiple speaker case is difficult as two distinct problems must now be addressed. Firstly, the active speakers must be identified and their locations estimated, requiring the use of multi-dimensional likelihoods, and then each speaker must be correctly associated with his corresponding location. In this paper we propose a multi-speaker tracking algorithm in which the number of active speakers is determined by estimating the profile of the noise-plus-reverberation covariance matrix eigenvalues. The multi-dimensional likelihoods are then decoupled using the Expectation Maximization (EM) algorithm. The tracking accuracy is improved by the inclusion of a pause detection step and estimation of the noise-plus-interference covariance matrix. The results show the benefits of the proposed methods under difficult tracking situations.
Keywords :
covariance matrices; microphone arrays; particle filtering (numerical methods); speech processing; active speakers; expectation maximization algorithm; multi-dimensional likelihoods; multi-speaker tracking algorithm; noise-plus-interference covariance matrix; noise-plus-reverberation covariance matrix eigenvalues; particle filtering techniques; Acoustic noise; Background noise; Covariance matrix; Eigenvalues and eigenfunctions; Filtering algorithms; Frequency; Information filtering; Information filters; Microphone arrays; Particle tracking; Microphone arrays; Noise-plus-reverberation covariance matrix estimation; Particle filtering algorithms; Source number estimation; multiple source tracking;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
ISSN :
1520-6149
Print_ISBN :
978-1-4244-1483-3
Type :
conf
DOI :
10.1109/ICASSP.2008.4517605
Filename :
4517605
Link To Document :
بازگشت