Title :
Speech overlap detection using convolutive non-negative sparse coding: New improvements and insights
Author :
Geiger, J?¼rgen T. ; Vipperla, Ravichander ; Evans, Nicholas ; Schuller, Bj?¶rn ; Rigoll, Gerhard
Author_Institution :
Inst. for Human-Machine Commun., Tech. Univ. Munchen, Munich, Germany
Abstract :
This paper presents recent advances in the application of convolutive non-negative sparse coding (CNSC) to the problem of overlap detection in the context of conference meetings and speaker diarization. CNSC is used to project a mixed speaker signal onto separate speaker bases and hence to detect intervals of competing speech. We present new energy ratio and total energy features which give significant improvements over our previous work. The system is assessed using a subset of the AMI meeting corpus. We report results which are comparable to the state of the art which support the potential of a new approach to overlap detection. An analysis of system performance highlights the importance of further work to addresses weaknesses in detecting particularly short segments of overlapping speech.
Keywords :
speaker recognition; speech coding; AMI; CNSC; convolutive nonnegative sparse coding; overlapping speech; speaker diarization; speaker signal; speech overlap detection; Density estimation robust algorithm; Feature extraction; Hidden Markov models; Sparse matrices; Speech; Speech coding; convolutive non-negative sparse coding; speaker diarization; speech overlap detection;
Conference_Titel :
Signal Processing Conference (EUSIPCO), 2012 Proceedings of the 20th European
Print_ISBN :
978-1-4673-1068-0