Title :
An extension to Fisher Linear Semi-Discriminant analysis for Speaker Diarization
Author :
Montazzolli, Sergio ; Adami, Andrea ; Barone, Dante
Author_Institution :
Inst. de Inf./PPGC, UFRGS, Porto Alegre, Brazil
Abstract :
The Fisher Linear Semi-Discriminant Analysis is used in Speaker Diarization to project acoustic features into a discriminant and lower dimensional space. Given that such analysis uses short segments to estimate the scatter matrices, the projection could be improved by using longer segments (i.e., more information). Since a change of speaker is more likely to occur during periods of non-speech, we propose to use segments of speech produced by the boundaries estimated from a voice activity detection method based on Hidden Markov Models. Using datasets from the NIST Speaker Recognition Evaluations, we show that the estimated segments provide a better scatter matrices for the analysis. The results show a relative improvement of 21% in the Speaker Error Time on the Switchboard corpus used in the evaluations.
Keywords :
S-matrix theory; estimation theory; hidden Markov models; speaker recognition; Hidden Markov models; acoustic features projection; fisher linear semidiscriminant analysis; scatter matrices; speaker diarization; speaker error time; speech segments; voice activity detection method; Hidden Markov models; Mel frequency cepstral coefficient; Nickel; Speech; Speech processing; Vectors;
Conference_Titel :
Telecommunications Symposium (ITS), 2014 International
Conference_Location :
Sao Paulo
DOI :
10.1109/ITS.2014.6947969