• DocumentCode
    394309
  • Title

    Online speaker clustering

  • Author

    Liu, Daben ; Kubala, Francis

  • Author_Institution
    BBN Technol., Cambridge, MA, USA
  • Volume
    1
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    This paper describes a set of new algorithms that perform speaker clustering in an online fashion. Unlike typical clustering approaches, the proposed method does not require the presence of all the data before performing clustering. The clustering decision is made as soon as an audio segment is received. Being causal, this method enables low-latency incremental speaker adaptation in online speech-to-text systems. It also gives a speaker tracking and indexing system the ability to label speakers with cluster ID on the fly. We show that the new online speaker clustering method yields better performance compared to the traditional hierarchical speaker clustering. Evaluation metrics for speaker clustering are also discussed.
  • Keywords
    pattern clustering; speech recognition; speech synthesis; audio segment; cluster ID; clustering decision; low-latency incremental speaker adaptation; online speaker clustering; online speech-to-text systems; speaker clustering algorithms; speaker indexing system; speaker tracking system; speech segments identification; Clustering algorithms; Clustering methods; Computational complexity; Gaussian distribution; Indexing; Labeling; Organizing; Speech recognition; Time factors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1198845
  • Filename
    1198845