Title :
Comparison of text-independent speaker recognition methods on telephone speech with acoustic mismatch
Author :
Van Vuuren, Sarel
Author_Institution :
Center for Spoken Language Understanding, Oregon Graduate Inst. of Sci. & Technol., Beaverton, OR, USA
Abstract :
We compare speaker recognition performance of vector quantization (VQ), Gaussian mixture modeling (GMM) and the Arithmetic Harmonic Sphericity measure (AHS) in adverse telephone speech conditions. The aim is to address the question: how do multimodal VQ and GMM typically compare to the simpler unimodal AHS for matched and mismatched training and testing environments? We study identification (closed set) and verification errors on a new multi environment database. We consider LPC and PLP features as well as their RASTA derivatives. We conclude that RASTA processing can remove redundancies from the features. We affirm that even when we use channel and noise compensation schemes, speaker recognition errors remain high when there is acoustic mismatch
Keywords :
Gaussian processes; linear predictive coding; multimedia computing; speaker recognition; speech processing; telecommunication computing; telephony; vector quantisation; Arithmetic Harmonic Sphericity measure; GMM; Gaussian mixture modeling; RASTA derivatives; RASTA processing; acoustic mismatch; adverse telephone speech conditions; multi environment database; multimodal VQ; noise compensation schemes; speaker recognition errors; speaker recognition performance; telephone speech; text independent speaker recognition methods; unimodal AHS; vector quantization; verification errors; Arithmetic; Automated highways; Linear predictive coding; Redundancy; Spatial databases; Speaker recognition; Speech; Telephony; Testing; Vector quantization;
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
DOI :
10.1109/ICSLP.1996.607976