DocumentCode
1749690
Title
Very large population text-independent speaker identification using transformation enhanced multi-grained models
Author
Chaudhari, Upendra V. ; Navrratil, J. ; Ramaswamy, Ganesh N. ; Maes, Stéphane H.
Author_Institution
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Volume
1
fYear
2001
fDate
2001
Firstpage
461
Abstract
Presents results on speaker identification with a population size of over 10000 speakers. Speaker modeling is accomplished via our transformation enhanced multigrained models. Pursuing two goals, the first is to study the performance of a number of different systems within the modeling framework of multi-grained models. The second is to analyze performance as a function of population size. We show that the most complex models within the framework perform the best and demonstrate that, in approximation, the identification error rate scales linearly with the log of the population size for the described system. Further, we develop a candidate rejection technique based on our analysis of the system performance which indicates a low confidence in the identity chosen
Keywords
Gaussian distribution; feature extraction; hidden Markov models; speaker recognition; candidate rejection technique; identification error rate; population size; speaker modeling; telephone-quality speech; transformation enhanced Gaussian mixture model; transformation enhanced multi-grained models; very large population text-independent speaker identification; Cepstral analysis; Error analysis; Mel frequency cepstral coefficient; Microphones; Performance analysis; Speaker recognition; Speech; System performance; Testing; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location
Salt Lake City, UT
ISSN
1520-6149
Print_ISBN
0-7803-7041-4
Type
conf
DOI
10.1109/ICASSP.2001.940867
Filename
940867
Link To Document