DocumentCode
2179831
Title
Using clustering comparison measures for speaker recognition
Author
Kua, Jia Min Karen ; Epps, Julien ; Nosratighods, Mohaddeseh ; Ambikairajah, Eliathamby ; Choi, Eric
fYear
2011
fDate
22-27 May 2011
Firstpage
5452
Lastpage
5455
Abstract
Recent results seem to cast some doubt over the assumption that improvements in fused recognition accuracy for speaker recognition systems based on different acoustic features are due mainly to the different origins of the features (e.g. magnitude, phase, modulation information). In this study, we utilize clustering comparison measures to investigate acoustic and speaker modelling aspects of the speaker recognition task separately and demonstrate that front-end diversity can be achieved purely through different ´partitioning´ of the acoustic space. Further, features that exhibit good ´stability´ with respect to repeated clustering are shown to also give good EER performance in speaker recognition. This has implications for feature choice, fusion of systems employing different features, and for UBM data selection. A method for the latter problem is presented that gives up to an 11% relative reduction in EER using only 20-30% of the usual UBM training data set.
Keywords
pattern clustering; speaker recognition; EER; UBM data selection; clustering comparison measures; front-end diversity; fused recognition accuracy; speaker recognition systems; Acoustic measurements; Mel frequency cepstral coefficient; NIST; Speaker recognition; Stability analysis; Training; UBM training; normalised information distance; normalised mutual information; speaker recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location
Prague
ISSN
1520-6149
Print_ISBN
978-1-4577-0538-0
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2011.5947592
Filename
5947592
Link To Document