DocumentCode :
3245442
Title :
A study of generic models for unsupervised on-line speaker indexing
Author :
Kwon, Soonil ; Narayanan, Shrikanth
Author_Institution :
Speech Anal. & Interpretation Lab., Univ. of Southern California, CA, USA
fYear :
2003
fDate :
30 Nov.-3 Dec. 2003
Firstpage :
423
Lastpage :
428
Abstract :
On-line speaker indexing sequentially detects the points where a speaker identity changes in a multi-speaker audio stream, and classifies each speaker segment. The paper addresses two challenges. The first relates to monitoring, which requires on-line processing. The second relates to the fact that the number/identity of the speakers is unknown. The indexing needs to be made in an unsupervised process. To address these issues, we apply a predetermined generic speaker-independent model set, sample speaker model (SSM). This set can be useful for more accurate speaker modeling and clustering without requiring training models on target speaker data. Once a speaker-independent model is selected from the sample models, it is adapted into a speaker-dependent model progressively. Experiments were performed with the speaker recognition benchmark NIST Speech (1999). Results showed that our new technique, simulated using the Markov chain Monte Carlo method, gave 92.47% indexing accuracy on telephone conversation data.
Keywords :
pattern classification; pattern clustering; speaker recognition; speech processing; Markov chain Monte Carlo method; clustering; generic models; multi-speaker audio stream; sample speaker model; speaker recognition benchmark; speaker segment classification; telephone conversation data; unsupervised on-line speaker indexing; Indexing; Monitoring; NIST; Speaker recognition; Speech analysis; Speech processing; Streaming media; Teleconferencing; Telephony; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition and Understanding, 2003. ASRU '03. 2003 IEEE Workshop on
Print_ISBN :
0-7803-7980-2
Type :
conf
DOI :
10.1109/ASRU.2003.1318478
Filename :
1318478
Link To Document :
بازگشت