Title :
Automatic speaker clustering from multi-speaker utterances
Author :
McLaughlin, Jack ; Reynolds, Douglas ; Singer, Elliot ; O´Leary, Gerald C.
Author_Institution :
Lincoln Lab., MIT, Lexington, MA, USA
Abstract :
Blind clustering of multi-person utterances by speaker is complicated by the fact that each utterance has at least two talkers. In the case of a two-person conversation, one can simply split each conversation into its respective speaker halves, but this introduces error which ultimately hurts clustering. We propose a clustering algorithm which is capable of associating each conversation with two clusters (and therefore two-speakers) obviating the need for splitting. Results are given for two speaker conversations culled from the Switchboard corpus, and comparisons are made to results obtained on single-speaker utterances. We conclude that although the approach is promising, our technique for computing inter-conversation similarities prior to clustering needs improvement
Keywords :
pattern clustering; speech recognition; automatic speaker clustering; blind clustering; clustering algorithm; inter-conversation similarities; multi-person utterances; multi-speaker utterances; two-person conversation; Cepstral analysis; Clustering algorithms; Clustering methods; Laboratories; Lifting equipment; Natural languages; Speech; Telephony; Tree data structures;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location :
Phoenix, AZ
Print_ISBN :
0-7803-5041-3
DOI :
10.1109/ICASSP.1999.759796