DocumentCode :
3530295
Title :
Robust two-channel TDOA estimation for multiple speaker localization by using recursive ICA and a state coherence transform
Author :
Nesta, F. ; Svaizer, P. ; Omologo, M.
Author_Institution :
Fondazione Bruno Kessler, Trento
fYear :
2009
fDate :
19-24 April 2009
Firstpage :
4597
Lastpage :
4600
Abstract :
A novel method is presented for a robust two channel multiple time difference of arrival (TDOA) estimation for multispeaker localization which can provide satisfactory performance even in highly reverberant environment. The method is based on a recursive frequency-domain independent component analysis (ICA) and on a novel state coherence transform (SCT). Exploiting the phase coherence of the demixing matrices obtained in the ICA stage the SCT is able to generate envelopes with clear peaks in the corresponding maximum-likelihood TDOAs. The SCT envelopes are computed independently in each time-block and accurate multiple TDOAs are estimated by means of a time-frequency sparse representation of the sources. The method has been applied to real data obtained by recording many sources in a room with a reverberation time of 700 ms. Experimental results show that an accurate localization of 7 closely-spaced sources is possible given only few seconds of data even in the case of low SNR. Experiments also show the advantage of using the proposed solution rather than the well-known GCC-PHAT.
Keywords :
independent component analysis; matrix algebra; maximum likelihood estimation; recursive estimation; signal representation; speaker recognition; time-frequency analysis; time-of-arrival estimation; transforms; demixing matrix; maximum-likelihood TDOA; multiple speaker localization; recursive independent component analysis; robust two-channel TDOA estimation; state coherence transform; time-frequency sparse representation; Frequency domain analysis; Independent component analysis; Maximum likelihood estimation; Recursive estimation; Reverberation; Robustness; Sparse matrices; State estimation; Time difference of arrival; Time frequency analysis; TDOA estimation; blind source separation (BSS); independent component analysis (ICA); multiple speaker localization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1520-6149
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2009.4960654
Filename :
4960654
Link To Document :
بازگشت