Title :
Novel inter-cluster distance measure combining GLR and ICR for improved agglomerative hierarchical speaker clustering
Author :
Han, Kyu J. ; Narayanan, Shrikanth S.
Author_Institution :
Ming Hsieh Dept. of Electr. Eng., Univ. of Southern California, Los Angeles, CA
fDate :
March 31 2008-April 4 2008
Abstract :
Agglomerative hierarchical clustering (AHC) has been a popular strategy for speaker clustering, due to its simple structure but acceptable level of performance. One of the main challenges in AHC that affects clustering performance is how to select the closest cluster pair for merging at every recursion. For this, generalized likelihood ratio (GLR) has been widely adopted as an inter-cluster distance measure. However, it tends to be affected by the size of the clusters considered, which could result in erroneous selection of the cluster pair to be merged during AHC. To tackle this problem, we propose a novel alternative to GLR in this paper, which is a combination of GLR and information change rate (ICR) that we recently introduced for addressing the aforementioned tendency of GLR. Experiments on various meeting speech data show that this combined measure improves clustering performance on average by around 30% (relative).
Keywords :
estimation theory; pattern clustering; speaker recognition; agglomerative hierarchical speaker clustering; generalized likelihood ratio; information change rate; intercluster distance measure; Degradation; Distance measurement; Electric variables measurement; Error analysis; Laboratories; Merging; Size control; Size measurement; Speech analysis; Viterbi algorithm; Speaker clustering; agglomerative hierarchical clustering (AHC); generalized likelihood ratio (GLR); information change rate (ICR);
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4518624