Title :
Phonetic speaker recognition
Author :
Kohler, Mary A. ; Andrews, Walter D. ; Campbell, Joseph P. ; Herndndez-Cordero, J.
Author_Institution :
Dept. of Defense, MIT, Lexington, MA, USA
Abstract :
This paper introduces a novel language-independent speaker-recognition system based on differences in dynamic realization of phonetic features (i.e., pronunciation) between speakers rather than spectral differences in voice quality. The system exploits phonetic information from six languages to perform text independent speaker recognition. All experiments were performed on the NIST 2001 Speaker Recognition Evaluation Extended Data Task. Recognition results are provided for unigram, bigram, and trigram models. Performance for each of the three models is examined for phones from each individual language and the final multilanguage fused system. Additional fusion experiments demonstrate that speaker recognition capability is maintained even without phonetic information in the language of the speaker.
Keywords :
feature extraction; speaker recognition; speech processing; NIST 2001 Speaker Recognition Evaluation Extended Data Task; bigram model; dynamic realization; language-independent speaker recognition; multilanguage fused system; performance; phonetic features; phonetic speaker recognition; pronunciation; refracted phone sequences; text independent speaker recognition; trigram model; unigram model; Acoustic testing; Automatic speech recognition; Cepstral analysis; Echo cancellers; NIST; Natural languages; Performance evaluation; Speaker recognition; Speech processing; Speech recognition;
Conference_Titel :
Signals, Systems and Computers, 2001. Conference Record of the Thirty-Fifth Asilomar Conference on
Conference_Location :
Pacific Grove, CA, USA
Print_ISBN :
0-7803-7147-X
DOI :
10.1109/ACSSC.2001.987748