DocumentCode
2368497
Title
Efficient speaker verification system using speaker model clustering for T and Z normalizations
Author
Ravulakollu, Kiran ; Apsingekar, Vijendra Raj ; De Leon, Phillip L.
Author_Institution
Klipsch Sch. of Electr. Eng., New Mexico State Univ., Las Cruces, NM
fYear
2008
fDate
13-16 Oct. 2008
Firstpage
56
Lastpage
62
Abstract
In speaker verification (SV) systems based on Gaussian mixture model-universal background model (GMM-UBM), normalization is an important component in the decision stage. Many normalization methods including the T- and Z-norms, have been proposed and investigated and these have contributed to state-of-the-art SV systems which have extremely low equal-error rates (EERs). In this paper, we consider application of both T- and Z-norms to a carefully selected subset of speakers using a data driven approach which can significantly reduce computation resulting in faster SV decisions and lower EER. Unfortunately, selection of the subset is critical and must be representative of the entire speaker model space otherwise error rates will increase. In order to properly select the subset of speakers for the normalizations, we propose a novel method which first clusters the speaker models using the K-means algorithm and the Kullback-Leibler (KL) divergence and then selects a set of speakers within the cluster. We evaluate the approach using both the TIMIT, NTIMIT and NIST-2002 corpora and compare against standard T- and Z-normalizations.
Keywords
Gaussian processes; decision making; error analysis; pattern clustering; speaker recognition; Gaussian mixture model-universal background model; K-means algorithm; Kullback-Leibler divergence; T normalization; Z normalization; equal-error rates; speaker model clustering; speaker model space; speaker verification system; Clustering algorithms; Decision making; Error analysis; Parameter estimation; Support vector machine classification; Support vector machines; Testing; Uncertainty; Weight control; Working environment noise; Clustering methods; Speaker recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Security Technology, 2008. ICCST 2008. 42nd Annual IEEE International Carnahan Conference on
Conference_Location
Prague
Print_ISBN
978-1-4244-1816-9
Electronic_ISBN
978-1-4244-1817-6
Type
conf
DOI
10.1109/CCST.2008.4751277
Filename
4751277
Link To Document