DocumentCode
2178473
Title
Overlapped speech detection using long-term spectro-temporal similarity in stereo recording
Author
Xiao, Bo ; Ghosh, Prasanta Kumar ; Georgiou, Panayiotis ; Narayanan, Shrikanth S.
Author_Institution
Dept. of Electr. Eng., Univ. of Southern California, Los Angeles, CA, USA
fYear
2011
fDate
22-27 May 2011
Firstpage
5216
Lastpage
5219
Abstract
The problem of detecting overlapped speech in stereo recordings using close-talk microphones is important for a variety of applications including the identification of back-channels, interruptions etc. in a dyadic or multi-party interactions. For detecting overlapped speech, we propose a feature derived using the spectral similarity of two channels over a range of acoustic frames. During overlapped speech frames the proposed spectro-temporal similarity-based feature values decrease and during non-overlapped speech frames the feature values increase due to the presence of cross-talk. Thus the proposed feature helps to discriminate the overlapped speech frames from the non-overlapped ones. Using overlapped speech detection experiments on a dyadic interaction corpus, it is shown that the proposed feature provides a significant improvement ~26% absolute, in the accuracy of detecting the overlapped speech frames when used as an additional feature to the baseline feature obtained from the two channels´ intensity profiles.
Keywords
speech processing; close-talk microphones; long-term spectro-temporal similarity; speech detection; stereo recording; Accuracy; Correlation; Erbium; Speech; Strontium; Tin; Training; correlation coefficient; overlapped speech; spectrogram; stereo recording;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location
Prague
ISSN
1520-6149
Print_ISBN
978-1-4577-0538-0
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2011.5947533
Filename
5947533
Link To Document