Title :
Online speaker clustering
Author :
Lilt, D. ; Kubala, Francis
Author_Institution :
BBN Technol., Cambridge, MA, USA
Abstract :
The paper describes a set of new algorithms that perform speaker clustering in an online fashion. Unlike typical clustering approaches, the proposed method does not require the presence of all the data before performing clustering. The clustering decision is made as soon as an audio segment is received. Being causal, this method enables low-latency incremental speaker adaptation in online speech-to-text systems. It also gives a speaker tracking and indexing system the ability to label speakers with cluster ID on the fly. We show that the new online speaker clustering method yields better performance compared to the traditional hierarchical speaker clustering. Evaluation metrics for speaker clustering are also discussed.
Keywords :
speaker recognition; audio indexing; audio segment; online speaker clustering; online speech-to-text systems; speaker adaptation; speaker indexing; speaker tracking; speech recognition; Clustering algorithms; Clustering methods; Computational complexity; Gaussian distribution; Indexing; Labeling; Organizing; Speech recognition; Time factors;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1325990