DocumentCode :
177428
Title :
Bayesian analysis of similarity matrices for speaker diarization
Author :
Sholokhov, Alexey ; Pekhovsky, Timur ; Kudashev, Oleg ; Shulipa, Andrei ; Kinnunen, Tomi
Author_Institution :
Sch. of Comput., Univ. of Eastern Finland, Kuopio, Finland
fYear :
2014
fDate :
4-9 May 2014
Firstpage :
106
Lastpage :
110
Abstract :
Inspired by recent success of speaker clustering in Total Variability space we propose a new probabilistic model for speaker diarization based on Bayesian modeling of pairwise similarity scores. The recordings are represented by symmetric similarity matrices of likelihood ratio scores from probabilistic linear discriminant analysis (PLDA) trained on short-term i-vectors. We employ Bayesian approach to address the problem of unknown number of speakers in conversation. Diarization error rates on the NIST 2008 SRE telephone data indicate comparable performance with state-of-the-art eigenvoice-based diarization. But unlike the eigenvoice approach, our method finds the number of speakers automatically, making the proposed model more viable for practical applications.
Keywords :
Bayes methods; pattern clustering; speaker recognition; vectors; Bayesian analysis; Bayesian modeling; NIST 2008 SRE telephone data; PLDA; diarization error rates; likelihood ratio scores; pairwise similarity scores; probabilistic linear discriminant analysis; probabilistic model; recording representation; short-term i-vectors; speaker clustering; speaker diarization; symmetric similarity matrices; total variability space; Bayes methods; Computational modeling; Data models; NIST; Probabilistic logic; Speech; Vectors; Speaker diarization; similarity matrix; variational Bayesian inference;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
Type :
conf
DOI :
10.1109/ICASSP.2014.6853567
Filename :
6853567
Link To Document :
بازگشت