• DocumentCode
    3530295
  • Title

    Robust two-channel TDOA estimation for multiple speaker localization by using recursive ICA and a state coherence transform

  • Author

    Nesta, F. ; Svaizer, P. ; Omologo, M.

  • Author_Institution
    Fondazione Bruno Kessler, Trento
  • fYear
    2009
  • fDate
    19-24 April 2009
  • Firstpage
    4597
  • Lastpage
    4600
  • Abstract
    A novel method is presented for a robust two channel multiple time difference of arrival (TDOA) estimation for multispeaker localization which can provide satisfactory performance even in highly reverberant environment. The method is based on a recursive frequency-domain independent component analysis (ICA) and on a novel state coherence transform (SCT). Exploiting the phase coherence of the demixing matrices obtained in the ICA stage the SCT is able to generate envelopes with clear peaks in the corresponding maximum-likelihood TDOAs. The SCT envelopes are computed independently in each time-block and accurate multiple TDOAs are estimated by means of a time-frequency sparse representation of the sources. The method has been applied to real data obtained by recording many sources in a room with a reverberation time of 700 ms. Experimental results show that an accurate localization of 7 closely-spaced sources is possible given only few seconds of data even in the case of low SNR. Experiments also show the advantage of using the proposed solution rather than the well-known GCC-PHAT.
  • Keywords
    independent component analysis; matrix algebra; maximum likelihood estimation; recursive estimation; signal representation; speaker recognition; time-frequency analysis; time-of-arrival estimation; transforms; demixing matrix; maximum-likelihood TDOA; multiple speaker localization; recursive independent component analysis; robust two-channel TDOA estimation; state coherence transform; time-frequency sparse representation; Frequency domain analysis; Independent component analysis; Maximum likelihood estimation; Recursive estimation; Reverberation; Robustness; Sparse matrices; State estimation; Time difference of arrival; Time frequency analysis; TDOA estimation; blind source separation (BSS); independent component analysis (ICA); multiple speaker localization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-2353-8
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2009.4960654
  • Filename
    4960654