• DocumentCode
    1474536
  • Title

    The Segmental Bayesian Information Criterion and Its Applications to Speaker Diarization

  • Author

    Stafylakis, Themos ; Katsouros, Vassilis ; Carayannis, George

  • Author_Institution
    Sch. of Electr. & Electron. Eng., Nat. Tech. Univ. of Athens, Athens, Greece
  • Volume
    4
  • Issue
    5
  • fYear
    2010
  • Firstpage
    857
  • Lastpage
    866
  • Abstract
    This paper discusses the use of the BIC with respect to speaker diarization, i.e., the problem of assigning the observation vectors of an audio file to a set of speakers of unknown cardinality. Our primary goals are to examine the two dominant approaches of the BIC, namely the global and the local and combine the strengths of the two variants into one intuitive criterion, the segmental-BIC. We then consider the asymptotic behavior of the segmental-BIC, when dealing with models that are highly misspecified, as the ones commonly used in the speaker diarization task. Our main result is a modified version of the BIC, which significantly outperforms the current variants over the entire range of operating points, and achieves performance close to those of highly computationally demanding algorithms.
  • Keywords
    speaker recognition; BIC; asymptotic behavior; computationally demanding algorithms; segmental Bayesian information criterion; speaker diarization; Algorithm design and analysis; Automatic speech recognition; Bayesian methods; Clustering algorithms; High performance computing; Information analysis; Natural languages; Performance analysis; Speech processing; Technological innovation; Bayesian information criterion (BIC); cluster analysis; clustering; speaker diarization (SD);
  • fLanguage
    English
  • Journal_Title
    Selected Topics in Signal Processing, IEEE Journal of
  • Publisher
    ieee
  • ISSN
    1932-4553
  • Type

    jour

  • DOI
    10.1109/JSTSP.2010.2048656
  • Filename
    5451053