Title :
Large-scale community detection on speaker content graphs
Author :
Shum, Stephen H. ; Campbell, W.M. ; Reynolds, Douglas A.
Author_Institution :
MIT CSAIL, Cambridge, MA, USA
Abstract :
We consider the use of community detection algorithms to perform speaker clustering on content graphs built from large audio corpora. We survey the application of agglomerative hierarchical clustering, modularity optimization methods, and spectral clustering as well as two random walk algorithms: Markov clustering and Infomap. Our results on graphs built from the NIST 2005+2006 and 2008+2010 Speaker Recognition Evaluations (SREs) provide insight into both the structure of the speakers present in the data and the intricacies of the clustering methods. In particular, we introduce an additional parameter to Infomap that improves its clustering performance on all graphs. Lastly, we also develop an automatic technique to purify the neighbors of each node by pruning away unnecessary edges.
Keywords :
optimisation; speaker recognition; Infomap; Markov clustering; SRE; agglomerative hierarchical clustering; automatic technique; clustering methods; community detection algorithms; large audio corpora; large-scale community detection; modularity optimization methods; random walk algorithms; speaker clustering; speaker content graphs; speaker recognition evaluations; spectral clustering; Algorithm design and analysis; Clustering algorithms; Communities; Matrix converters; NIST; Speaker recognition; Speech; community detection; modularity optimization; random walk algorithms; speaker clustering; spectral clustering;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6639165