DocumentCode :
417374
Title :
Binaural sound segregation for multisource reverberant environments
Author :
Roman, Nicoleta ; Wang, DeLiang
Author_Institution :
Dept. of Comput. & Inf. Sci., Ohio State Univ., Columbus, OH, USA
Volume :
2
fYear :
2004
fDate :
17-21 May 2004
Abstract :
We present a novel method for binaural sound segregation from acoustic mixtures contaminated by both multiple interference and reverberation. We employ the notion of an ideal time-frequency binary mask, which selects the target if it is stronger than the interference in a local time-frequency (T-F) unit. As opposed to classical adaptive filtering, which focuses on the suppression of noise, our model employs an adaptive filter that performs target cancellation. T-F units dominated by a target are largely suppressed at the output of the cancellation unit when compared to units dominated by noise. Consequently, the actual input-to-output attenuation level in each T-F unit is used to estimate an ideal binary mask. A systematic evaluation in terms of automatic speech recognition performance shows that the resulting system produces masks close to ideal binary ones.
Keywords :
acoustic signal processing; adaptive filters; interference suppression; reverberation; source separation; speech processing; time-frequency analysis; acoustic mixtures; adaptive filter; automatic speech recognition performance; binaural sound segregation; multiple interference; multisource reverberant environments; reverberation; target cancellation; time-frequency binary mask; time-frequency unit; Acoustic noise; Adaptive filters; Attenuation; Automatic speech recognition; Degradation; Delay effects; Interference; Noise cancellation; Reverberation; Time frequency analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1326272
Filename :
1326272
Link To Document :
بازگشت