• DocumentCode
    3423849
  • Title

    Combination of agglomerative and sequential clustering for speaker diarization

  • Author

    Vijayasenan, Deepu ; Valente, Fabio ; Bourlard, Hervé

  • Author_Institution
    IDIAP Res. Inst., Martigny
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    4361
  • Lastpage
    4364
  • Abstract
    This paper aims at investigating the use of sequential clustering for speaker diarization. Conventional diarization systems are based on parametric models and agglomerative clustering. In our previous work we proposed a non-parametric method based on the agglomerative information bottleneck for very fast diarization. Here we consider the combination of sequential and agglomerative clustering for avoiding local maxima of the objective function and for purification. Experiments are run on the RT06 eval data. Sequential Clustering with oracle model selection can reduce the speaker error by 10% w.r.t. agglomerative clustering. When the model selection is based on Normalized Mutual Information criterion, a relative improvement of 5% is obtained using a combination of agglomerative and sequential clustering.
  • Keywords
    speaker recognition; speech synthesis; agglomerative clustering; normalized mutual information criterion; oracle model selection; sequential clustering; speaker diarization; speaker error; Bayesian methods; Clustering algorithms; Hidden Markov models; Mutual information; Parameter estimation; Parametric statistics; Partitioning algorithms; Purification; Speech; Streaming media; Meetings data; Speaker Diarization; agglomerative and sequential information bottleneck;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-1483-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2008.4518621
  • Filename
    4518621