• DocumentCode
    177422
  • Title

    Variational Bayes based I-vector for speaker diarization of telephone conversations

  • Author

    Rong Zheng ; Ce Zhang ; Shanshan Zhang ; Bo Xu

  • Author_Institution
    Interactive Digital Media Technol. Res. Center, Inst. of Autom., Beijing, China
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    91
  • Lastpage
    95
  • Abstract
    In this paper, we investigate the variational Bayes based I-vector method for speaker diarization of telephone conversations. The motivation of the proposed algorithm is to utilize variational Bayesian framework and exploit potential channel effect of total variability modeling for diarization of conversation side. Other three well-known techniques are compared as follows: K-means clustering for eigenvoices and I-vector speaker diarization, and variational Bayes applied to eigenvoices. Performance evaluations are conducted on the summed-channel telephone data from the 2008 NIST speaker recognition evaluation. The paper discusses how the performance is influenced by different modules, e.g., VAD, initial speaker clustering and Viterbi re-segmentation. Comparison experiments show the interest of variational Bayesian probabilistic framework for speaker diarization.
  • Keywords
    Bayes methods; Viterbi detection; eigenvalues and eigenfunctions; pattern clustering; speaker recognition; telephone sets; variational techniques; VAD; Viterbi resegmentation; eigenvoices; performance evaluation; potential channel effect; speaker clustering; speaker diarization; summed-channel telephone; telephone conversations; total variability modeling; variational Bayes based I-vector method; Bayes methods; Clustering algorithms; Density estimation robust algorithm; Speech; Speech processing; Vectors; Viterbi algorithm; I-vector; eigenvoices; speaker diarization; total variability; variational Bayes;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6853564
  • Filename
    6853564