DocumentCode :
755682
Title :
Large population speaker identification using clean and telephone speech
Author :
Reynolds, Douglas A.
Author_Institution :
Lincoln Lab., MIT, Cambridge, MA, USA
Volume :
2
Issue :
3
fYear :
1995
fDate :
3/1/1995 12:00:00 AM
Firstpage :
46
Lastpage :
48
Abstract :
This paper presents text-independent speaker identification results for varying speaker population sizes up to 630 speakers for both clean, wideband speech, and telephone speech. A system based on Gaussian mixture speaker models is used for speaker identification, and experiments are conducted on the TIMIT and NTIMIT databases. The TIMIT results show large population performance under near-ideal conditions, and the NTIMIT results show the corresponding accuracy loss due to telephone transmission. These are believed to be the first speaker identification experiments on the complete 630 speaker TIMIT and NTIMIT databases and the largest text-independent speaker identification task reported to date. Identification accuracies of 99.5 and 60.7% were achieved on the TIMIT and NTIMIT databases, respectively.<>
Keywords :
Gaussian processes; speaker recognition; telephony; Gaussian mixture speaker models; NTIMIT database; TIMIT database; clean speech; large population speaker identification; near-ideal conditions; telephone speech; telephone transmission; text-independent speaker identification; wideband speech; Additive noise; Databases; Degradation; Loudspeakers; Performance loss; Propagation losses; Speech analysis; Telephony; Wideband;
fLanguage :
English
Journal_Title :
Signal Processing Letters, IEEE
Publisher :
ieee
ISSN :
1070-9908
Type :
jour
DOI :
10.1109/97.372913
Filename :
372913
Link To Document :
بازگشت