• DocumentCode
    1670246
  • Title

    A probabilistic framework for multiple speaker localization

  • Author

    Oualil, Youssef ; Magimai-Doss, Mathew ; Faubel, Friedrich ; Klakow, Dietrich

  • Author_Institution
    Spoken Language Syst., Saarland Univ., Saarbrucken, Germany
  • fYear
    2013
  • Firstpage
    3962
  • Lastpage
    3966
  • Abstract
    This paper presents a novel probabilistic framework for localizing multiple speakers with a microphone array. In this framework, the generalized cross correlation function (GCC) of each microphone pair is interpreted as a probability distribution of the time difference of arrival (TDOA) and subsequently approximated as a Gaussian mixture. The distribution parameters are estimated with a weighted expectation maximization algorithm. Then, the joint distribution of the TDOA Gaussian mixtures is mapped to a multimodal distribution in the location space, where each mode represents a potential source location. The approach taken here performs the localization by 1) reducing the search space to some regions that are likely to contain a source and then 2) extracting the actual speaker locations with a numerical optimization algorithm. The effectiveness of the proposed approach is shown using the AV16.3 corpus.
  • Keywords
    microphone arrays; speaker recognition; time-of-arrival estimation; AV16.3 corpus; Gaussian mixtures; generalized cross correlation function; microphone array; multimodal distribution; multiple speaker localization; numerical optimization algorithm; probabilistic framework; speaker locations; time difference of arrival; Acoustics; Arrays; Joints; Microphones; Position measurement; Probabilistic logic; Speech; Gaussian mixture; Microphone arrays; localization; multiple speakers; steered response power;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6638402
  • Filename
    6638402