Title :
Overlapped speech detection using long-term spectro-temporal similarity in stereo recording
Author :
Xiao, Bo ; Ghosh, Prasanta Kumar ; Georgiou, Panayiotis ; Narayanan, Shrikanth S.
Author_Institution :
Dept. of Electr. Eng., Univ. of Southern California, Los Angeles, CA, USA
Abstract :
The problem of detecting overlapped speech in stereo recordings using close-talk microphones is important for a variety of applications including the identification of back-channels, interruptions etc. in a dyadic or multi-party interactions. For detecting overlapped speech, we propose a feature derived using the spectral similarity of two channels over a range of acoustic frames. During overlapped speech frames the proposed spectro-temporal similarity-based feature values decrease and during non-overlapped speech frames the feature values increase due to the presence of cross-talk. Thus the proposed feature helps to discriminate the overlapped speech frames from the non-overlapped ones. Using overlapped speech detection experiments on a dyadic interaction corpus, it is shown that the proposed feature provides a significant improvement ~26% absolute, in the accuracy of detecting the overlapped speech frames when used as an additional feature to the baseline feature obtained from the two channels´ intensity profiles.
Keywords :
speech processing; close-talk microphones; long-term spectro-temporal similarity; speech detection; stereo recording; Accuracy; Correlation; Erbium; Speech; Strontium; Tin; Training; correlation coefficient; overlapped speech; spectrogram; stereo recording;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947533