DocumentCode :
3568355
Title :
Speech overlap detection using convolutive non-negative sparse coding: New improvements and insights
Author :
Geiger, J?¼rgen T. ; Vipperla, Ravichander ; Evans, Nicholas ; Schuller, Bj?¶rn ; Rigoll, Gerhard
Author_Institution :
Inst. for Human-Machine Commun., Tech. Univ. Munchen, Munich, Germany
fYear :
2012
Firstpage :
340
Lastpage :
344
Abstract :
This paper presents recent advances in the application of convolutive non-negative sparse coding (CNSC) to the problem of overlap detection in the context of conference meetings and speaker diarization. CNSC is used to project a mixed speaker signal onto separate speaker bases and hence to detect intervals of competing speech. We present new energy ratio and total energy features which give significant improvements over our previous work. The system is assessed using a subset of the AMI meeting corpus. We report results which are comparable to the state of the art which support the potential of a new approach to overlap detection. An analysis of system performance highlights the importance of further work to addresses weaknesses in detecting particularly short segments of overlapping speech.
Keywords :
speaker recognition; speech coding; AMI; CNSC; convolutive nonnegative sparse coding; overlapping speech; speaker diarization; speaker signal; speech overlap detection; Density estimation robust algorithm; Feature extraction; Hidden Markov models; Sparse matrices; Speech; Speech coding; convolutive non-negative sparse coding; speaker diarization; speech overlap detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference (EUSIPCO), 2012 Proceedings of the 20th European
ISSN :
2219-5491
Print_ISBN :
978-1-4673-1068-0
Type :
conf
Filename :
6333888
Link To Document :
بازگشت