• DocumentCode
    3849572
  • Title

    Online speaker segmentation and clustering using cross-likelihood ratio calculation with reference criterion selection

  • Author

    M. Grasic;M. Kos;Z. Kacic

  • Author_Institution
    Faculty of Electrical Engineering and Computer Science, University of Maribor, Maribor, Slovenia
  • Volume
    4
  • Issue
    6
  • fYear
    2010
  • Firstpage
    673
  • Lastpage
    685
  • Abstract
    This study presents a new online method for speaker segmentation and clustering in real-world environments. It analyses and discusses the difficulties of online speaker diarisation and proposes a new segmentation and clustering method, in which the Bayesian information criterion (BIC) and the normalised cross-likelihood ratio (NCLR) are combined into an online speaker diarisation system. A new decision parameter for BIC and NCLR is proposed using normalisation with reference criterion selection (NRCS), together with a window normalisation technique called window-length compensation (WLC), which normalises the criterion value according to analysed window length. The effectiveness of the proposed system and techniques in comparison to the standard offline speaker diarisation system (mClust) is demonstrated on the Slovenian Broadcast News database (BNSI) and an English Broadcast News database (the HUB-4). The online system presented in this study achieves similar performance to the BIC-based offline approach.
  • Journal_Title
    IET Signal Processing
  • Publisher
    iet
  • ISSN
    1751-9675
  • Type

    jour

  • DOI
    10.1049/iet-spr.2009.0235
  • Filename
    5665899