DocumentCode
3423831
Title
New implementations of the E-HMM-based system for speaker diarization in meeting rooms
Author
Fredouille, Corinne ; Evans, Nicholas
Author_Institution
LIA, Univ. of Avignon, Avignon
fYear
2008
fDate
March 31 2008-April 4 2008
Firstpage
4357
Lastpage
4360
Abstract
This paper addresses the problem of speaker diarization in the specific context of meeting room recordings. Some new enhancements to the E-HMM-based speaker diarization system are reported. These involve a different approach to speaker modelling utilising EM/ML-based training rather than MAP adaptation as in our previous work. Using the new system we investigate the effects of speech activity detection through speaker diarization experiments conducted on 23 meetings extracted from the NIST/RT evaluation campaign datasets. We propose a new approach, which assigns confidence values according to the type of information carried by the signal and incorporates these values directly into the speaker diarization system. Experimental results show that, perhaps surprisingly, the non-speech segments do not systematically affect the robustness of the speaker diarization system, and more precisely the speaker model training process.
Keywords
hidden Markov models; speaker recognition; speech processing; HMM-based speaker diarization system; meeting room recordings; nonspeech segments; speaker diarization system; speaker modelling; speech activity detection; Clustering algorithms; Data mining; Microphones; NIST; Protocols; Robustness; Shape; Speaker recognition; Speech analysis; Speech enhancement; confidence values; meeting rooms; speaker diarization; speaker recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location
Las Vegas, NV
ISSN
1520-6149
Print_ISBN
978-1-4244-1483-3
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2008.4518620
Filename
4518620
Link To Document