• DocumentCode
    1650310
  • Title

    An integration of source location cues for speech clustering in distributed microphone arrays

  • Author

    Souden, Mehrez ; Kinoshita, Keizo ; Nakatani, Takeshi

  • Author_Institution
    NTT Commun. Sci. Labs., Kyoto, Japan
  • fYear
    2013
  • Firstpage
    111
  • Lastpage
    115
  • Abstract
    We propose a new approach for clustering competing speech sources using distributed microphone arrays. In this approach, we first define two feature vectors where the first captures the intra-node location information while the second captures the level difference of speech energy recorded at different nodes. Then, we introduce Watson and Dirichlet mixture models to model the first and second features, respectively. We integrate both types of information in an expectation maximization algorithm to cluster the simultaneous speech sources. The performance of the proposed approach is superior to best node selection and comparable to centralized processing in terms of conventional blind source separation metrics.
  • Keywords
    blind source separation; expectation-maximisation algorithm; microphone arrays; Dirichlet mixture model; Watson mixture model; blind source separation metrics; distributed microphone arrays; expectation maximization algorithm; feature vectors; intranode location information; level difference; source location cues; speech clustering; speech energy; speech sources; Algorithm design and analysis; Blind source separation; Clustering algorithms; Microphone arrays; Speech; Vectors; Distributed microphone array; blind source separation; expectation maximization; source clustering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6637619
  • Filename
    6637619