• DocumentCode
    3143574
  • Title

    Enhanced multidimensional spatial functions for unambiguous localization of multiple sparse acoustic sources

  • Author

    Nesta, Francesco ; Omologo, Maurizio

  • Author_Institution
    Center of Inf. Technol., Fondazione Bruno Kessler-Irst, Trento, Italy
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    213
  • Lastpage
    216
  • Abstract
    The Steered Response Power with PHAT transform (SRP-PHAT) or Global Coherence Field (GCF), has become a standard method for acoustic source localization, thanks to their simplicity, computational inexpensiveness and robustness against mid-high reverberation. However, originally formulated for the single source localization case, it does not apply satisfactorily to the multiple source case. In this paper, we analyze the structure of the spatial function and reshape it according to a generic multidimensional metric. We show that traditional functions are based on the L1 norm which is prone to generate ambiguous locations with high likelihood (i.e. ghosts). A more generic multidimensional kernel based on higher norms and on a partitioned representation of the cross-power spectrum is introduced, which better exploits the source sparseness in the discrete time-frequency domain. Evaluation results over simulated data show that the new spatial functions considerably improve the detection of multiple competing sources in both spatial and multidimensional TDOA domains.
  • Keywords
    acoustic radiators; acoustic signal processing; source separation; PHAT transform; acoustic source localization; discrete time-frequency domain; generic multidimensional kernel; global coherence field; mid-high reverberation; multidimensional spatial functions; multiple sparse acoustic sources; steered response power; unambiguous localization; Coherence; Estimation; Kernel; Microphones; Time frequency analysis; Vectors; TDOA estimation; kernel methods; multidimensional signal processing; multiple speaker localization; sparse sources;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6287855
  • Filename
    6287855