DocumentCode
3716201
Title
Improved binary key speaker diarization system
Author
Héctor Delgado;Xavier Anguera;Corinne Fredouille;Javier Serrano
Author_Institution
CAIAC, Autonomous University of Barcelona, Cerdanyola del Vallè
fYear
2015
Firstpage
2087
Lastpage
2091
Abstract
The recently proposed speaker diarization technique based on binary keys provides a very fast alternative to state-of-the-art systems. However, this speed up has the cost of a little increase in Diarization Error Rate (DER). This paper proposes a series of improvements to the original algorithm with the aim to get closer to state-of-the-art performance. First, several alternative similarity measures between binary key speaker/segment models are introduced. Second, we perform a first attempt at applying Intra-Session and IntraSpeaker Variability (ISISV) compensation within the binary diarization approach through the Nuisance Attribute Projection. Experimental results show the benefits of the newly introduced similarity metrics, as well as the potential of the Nuisance Attribute Projection for ISISV compensation in the binary key speaker diarization framework.
Keywords
"Speech","Measurement","Speaker recognition","Acoustics","Eigenvalues and eigenfunctions","Europe","Signal processing"
Publisher
ieee
Conference_Titel
Signal Processing Conference (EUSIPCO), 2015 23rd European
Electronic_ISBN
2076-1465
Type
conf
DOI
10.1109/EUSIPCO.2015.7362752
Filename
7362752
Link To Document