Title :
Improved overlap speech diarization of meeting recordings using long-term conversational features
Author :
Yella, Sree Harsha ; Bourlard, Herve
Author_Institution :
Idiap Res. Inst., Martigny, Switzerland
Abstract :
Overlapping speech is a source of significant errors in speaker diarization of spontaneous meeting recordings. Recent works on speaker diarization have attempted to solve the problem of overlap detection using classifiers trained on acoustic and spatial features. This paper proposes a method to improve the short-term spectral feature based overlap detector by incorporating information from long-term conversational features in the form of speaker change statistics. The statistics are obtained at segment level(around few seconds) from the output of a diarization system. The approach is motivated by the observation that segments containing more speaker changes are more probable to have more overlaps. Experiments on AMI meeting corpus reveal that the number of overlaps in a segment follows a Poisson distribution whose rate is directly proportional to the number of speaker changes in the segment. When this information is combined with acoustic information in an HMM/GMM overlap detector, improvements are verified in terms of F-measure and consequently, diarization error (DER) is reduced by 5% relative to the baseline overlap detector.
Keywords :
Gaussian processes; Poisson distribution; hidden Markov models; speech processing; statistical analysis; DER; F-measure; HMM-GMM overlap detector; Poisson distribution; acoustic features; diarization error; long-term conversational features; meeting recordings; overlap detection; overlap speech diarization; short-term spectral feature; spatial features; speaker change statistics; speaker diarization; Acoustics; Detectors; Entropy; Feature extraction; Labeling; Probability; Speech; meetings; speaker diarization; spontaneous conversations; spontaneous overlapping speech;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6639171