• DocumentCode
    2809597
  • Title

    SRP-PHAT methods of locating simultaneous multiple talkers using a frame of microphone array data

  • Author

    Do, Hoang ; Silverman, Harvey F.

  • Author_Institution
    Div. of Eng., Brown Univ., Providence, RI, USA
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    125
  • Lastpage
    128
  • Abstract
    Two new methods for locating multiple sound sources using a single segment of data from a large-aperture microphone array are presented. Both methods employ the proven-robust steered response power using the phase transform (SRP-PHAT) as a functional. To cluster the data points into highly probable regions containing global peaks, the first method fits a Gaussian mixture model (GMM), whereas the second one sequentially finds the points with highest SRP-PHAT values that most likely represent different clusters. Then the low-cost global optimization method, stochastic region contraction (SRC), is applied to each cluster to find the global peaks. We test the two methods using real data from five simultaneous talkers in a room with high noise and reverberation. Results are presented and discussed.
  • Keywords
    Gaussian processes; acoustic arrays; acoustic signal processing; microphone arrays; optimisation; GMM; Gaussian mixture model; SRC; SRP-PHAT method; global optimization; large-aperture microphone array; phase transform; steered response power; stochastic region contraction; Acoustic noise; Acoustical engineering; Clustering algorithms; Data engineering; Integrated circuit modeling; Microphone arrays; Optimization methods; Position measurement; Power engineering and energy; Reverberation; Acoustic radiators; acoustic arrays; acoustic position measurement; microphones;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5496133
  • Filename
    5496133