DocumentCode :
2697010
Title :
Hybrid Speech/non-speech detector applied to Speaker Diarization of Meetings
Author :
Anguera, Xavier ; Aguilo, Mateu ; Wooters, Chuck ; Nadeu, Climent ; Hernando, Javier
Author_Institution :
Int. Comput. Sci. Inst., Berkeley, CA
fYear :
2006
fDate :
28-30 June 2006
Firstpage :
1
Lastpage :
6
Abstract :
When performing speaker diarization, it is common practice to use an agglomerative clustering approach where the acoustic data is first split in small segments and then pairs of these segments are merged until a particular stopping point is reached. The diarization performance can be greatly improved by the use of a speech/non-speech detector. The use of a speech/non-speech detector helps the diarization system by preventing non-speech frames from "confusing" both the merging and the stopping processes. Over the years there has been extensive research on speech/non-speech detectors. Often times, speech/non-speech detectors require training data and their accuracy is strongly dependent on setting various thresholds correctly. In this work we present a hybrid speech/non-speech detector for use in our speaker diarization system within the meetings domain. Our proposed speech/non-speech system runs in two stages. The first stage performs an energy-based detection. The second stage performs a model-based decoding using the previous stage\´s data as a bootstrap for the acoustic models, thus avoiding the need for any outside training data. We show an improvement of 14% and 10% relative on a development and test set
Keywords :
acoustic signal detection; decoding; speaker recognition; acoustic model; agglomerative clustering approach; hybrid speech-nonspeech detector; meeting speaker diarization; model-based decoding; Acoustic signal detection; Acoustic testing; Audio recording; Computer science; Decoding; Detectors; Hidden Markov models; Loudspeakers; Speech processing; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Speaker and Language Recognition Workshop, 2006. IEEE Odyssey 2006: The
Conference_Location :
San Juan
Print_ISBN :
1-424400471-1
Electronic_ISBN :
1-4244-0472-X
Type :
conf
DOI :
10.1109/ODYSSEY.2006.248109
Filename :
4013526
Link To Document :
بازگشت