Title :
An integration of source location cues for speech clustering in distributed microphone arrays
Author :
Souden, Mehrez ; Kinoshita, Keizo ; Nakatani, Takeshi
Author_Institution :
NTT Commun. Sci. Labs., Kyoto, Japan
Abstract :
We propose a new approach for clustering competing speech sources using distributed microphone arrays. In this approach, we first define two feature vectors where the first captures the intra-node location information while the second captures the level difference of speech energy recorded at different nodes. Then, we introduce Watson and Dirichlet mixture models to model the first and second features, respectively. We integrate both types of information in an expectation maximization algorithm to cluster the simultaneous speech sources. The performance of the proposed approach is superior to best node selection and comparable to centralized processing in terms of conventional blind source separation metrics.
Keywords :
blind source separation; expectation-maximisation algorithm; microphone arrays; Dirichlet mixture model; Watson mixture model; blind source separation metrics; distributed microphone arrays; expectation maximization algorithm; feature vectors; intranode location information; level difference; source location cues; speech clustering; speech energy; speech sources; Algorithm design and analysis; Blind source separation; Clustering algorithms; Microphone arrays; Speech; Vectors; Distributed microphone array; blind source separation; expectation maximization; source clustering;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6637619