DocumentCode
1692102
Title
Impact of overlapping speech detection on speaker diarization for broadcast news and debates
Author
Charlet, D. ; Barras, Claude ; Lienard, Jean-Sylvain
Author_Institution
Orange Labs., France Telecom, Lannion, France
fYear
2013
Firstpage
7707
Lastpage
7711
Abstract
The overlapping speech detection systems developped by Orange and LIMSI for the ETAPE evaluation campaign on French broadcast news and debates are described. Using either cepstral features or a multi-pitch analysis, a F1-measure for overlapping speech detection up to 59.2% is reported on the TV data of the ETAPE evaluation set, where 6.7% of the speech was measured as overlapping, ranging from 1.2% in the news to 10.4% in the debates. Overlapping speech segments were excluded during the speaker diarization stage, and these segments were further labelled with the two nearest speaker labels, taking into account the temporal distance. We describe the effects of this strategy for various overlapping speech systems and we show that it improves the diarization error rate in all situations and up to 26.1% relative in our best configuration.
Keywords
cepstral analysis; speech recognition; ETAPE evaluation campaign; ETAPE evaluation set; F1-measure; French broadcast news; cepstral features; debates; diarization error rate; multipitch analysis; nearest speaker labels; overlapping speech detection systems; overlapping speech segments; overlapping speech systems; speaker diarization stage; speech measurement; temporal distance; Abstracts; Acoustics; Density estimation robust algorithm; Speech; Speech recognition; overlapping speech; speaker diarization;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location
Vancouver, BC
ISSN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2013.6639163
Filename
6639163
Link To Document