DocumentCode
394309
Title
Online speaker clustering
Author
Liu, Daben ; Kubala, Francis
Author_Institution
BBN Technol., Cambridge, MA, USA
Volume
1
fYear
2003
fDate
6-10 April 2003
Abstract
This paper describes a set of new algorithms that perform speaker clustering in an online fashion. Unlike typical clustering approaches, the proposed method does not require the presence of all the data before performing clustering. The clustering decision is made as soon as an audio segment is received. Being causal, this method enables low-latency incremental speaker adaptation in online speech-to-text systems. It also gives a speaker tracking and indexing system the ability to label speakers with cluster ID on the fly. We show that the new online speaker clustering method yields better performance compared to the traditional hierarchical speaker clustering. Evaluation metrics for speaker clustering are also discussed.
Keywords
pattern clustering; speech recognition; speech synthesis; audio segment; cluster ID; clustering decision; low-latency incremental speaker adaptation; online speaker clustering; online speech-to-text systems; speaker clustering algorithms; speaker indexing system; speaker tracking system; speech segments identification; Clustering algorithms; Clustering methods; Computational complexity; Gaussian distribution; Indexing; Labeling; Organizing; Speech recognition; Time factors;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7663-3
Type
conf
DOI
10.1109/ICASSP.2003.1198845
Filename
1198845
Link To Document