DocumentCode
3530295
Title
Robust two-channel TDOA estimation for multiple speaker localization by using recursive ICA and a state coherence transform
Author
Nesta, F. ; Svaizer, P. ; Omologo, M.
Author_Institution
Fondazione Bruno Kessler, Trento
fYear
2009
fDate
19-24 April 2009
Firstpage
4597
Lastpage
4600
Abstract
A novel method is presented for a robust two channel multiple time difference of arrival (TDOA) estimation for multispeaker localization which can provide satisfactory performance even in highly reverberant environment. The method is based on a recursive frequency-domain independent component analysis (ICA) and on a novel state coherence transform (SCT). Exploiting the phase coherence of the demixing matrices obtained in the ICA stage the SCT is able to generate envelopes with clear peaks in the corresponding maximum-likelihood TDOAs. The SCT envelopes are computed independently in each time-block and accurate multiple TDOAs are estimated by means of a time-frequency sparse representation of the sources. The method has been applied to real data obtained by recording many sources in a room with a reverberation time of 700 ms. Experimental results show that an accurate localization of 7 closely-spaced sources is possible given only few seconds of data even in the case of low SNR. Experiments also show the advantage of using the proposed solution rather than the well-known GCC-PHAT.
Keywords
independent component analysis; matrix algebra; maximum likelihood estimation; recursive estimation; signal representation; speaker recognition; time-frequency analysis; time-of-arrival estimation; transforms; demixing matrix; maximum-likelihood TDOA; multiple speaker localization; recursive independent component analysis; robust two-channel TDOA estimation; state coherence transform; time-frequency sparse representation; Frequency domain analysis; Independent component analysis; Maximum likelihood estimation; Recursive estimation; Reverberation; Robustness; Sparse matrices; State estimation; Time difference of arrival; Time frequency analysis; TDOA estimation; blind source separation (BSS); independent component analysis (ICA); multiple speaker localization;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location
Taipei
ISSN
1520-6149
Print_ISBN
978-1-4244-2353-8
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2009.4960654
Filename
4960654
Link To Document