DocumentCode
481002
Title
Segmental K-Means initialization for SOM-based speaker clustering
Author
Ben-Harush, Oshry ; Lapidot, Itshak ; Guterman, Hugo
Author_Institution
Dept. of Electr. & Comput. Eng., Ben-Gurion Univ. of the Negev, Beer-Sheva
Volume
1
fYear
2008
fDate
10-12 Sept. 2008
Firstpage
305
Lastpage
308
Abstract
A new approach for initial assignment of data in a speaker clustering application is presented. This approach employs segmental k-means clustering algorithm prior to competitive based learning. The clustering system relies on self-organizing maps (SOM) for speaker modeling and as a likelihood estimator. Performance is evaluated on 108 two speaker conversations taken from LDC CALLHOME American English Speech corpus using NIST criterion and shows an improvement of 20%-30% in cluster error rate (CER) relative to the randomly initialized clustering system. The number of iterations was reduced significantly, which contributes to both speed and efficiency of the clustering system.
Keywords
pattern clustering; self-organising feature maps; speaker recognition; cluster error rate; segmental k-means clustering algorithm; self-organizing maps; speaker clustering; Clustering algorithms; Data engineering; Educational institutions; Error analysis; Iterative algorithms; NIST; Neurons; Pattern recognition; Self organizing feature maps; Speech analysis; Clustering; Initial Conditions; K-means; SOM; Speech;
fLanguage
English
Publisher
ieee
Conference_Titel
ELMAR, 2008. 50th International Symposium
Conference_Location
Zadar
ISSN
1334-2630
Print_ISBN
978-1-4244-3364-3
Type
conf
Filename
4747495
Link To Document