DocumentCode
1692134
Title
Efficient iterative mean shift based cosine dissimilarity for multi-recording speaker clustering
Author
Senoussaoui, Mohammed ; Kenny, P. ; Dumouchel, P. ; Stafylakis, Themos
Author_Institution
Ecole de Technol. Super. (ETS), Montréal, QC, Canada
fYear
2013
Firstpage
7712
Lastpage
7715
Abstract
Speaker clustering is an important task in many applications such as Speaker Diarization as well as Speech Recognition. Speaker clustering can be done within a single multi-speaker recording (Diarization) or for a set of different recordings. In this work we are interested by the former case and we propose a simple iterative Mean Shift (MS) algorithm to deal with this problem. Traditionally, MS algorithm is based on Euclidean distance. We propose to use the Cosine distance in order to build a new version of MS algorithm. We report results as measured by speaker and cluster impurities on NIST SRE 2008 datasets.
Keywords
audio recording; iterative methods; speaker recognition; Euclidean distance; MS algorithm; NIST SRE 2008 datasets; cluster impurities; cosine distance; iterative mean shift based cosine dissimilarity; multirecording speaker clustering; single multispeaker recording; speaker diarization; speaker impurities; speech recognition; Clustering algorithms; Euclidean distance; Impurities; Integrated circuits; Kernel; Speaker recognition; Speech; Cluster Impurity; Cosine distance; Mean Shift (MS); Speaker Clustering; Speaker Impurity;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location
Vancouver, BC
ISSN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2013.6639164
Filename
6639164
Link To Document