DocumentCode :
2199906
Title :
Real-time source separation based on sound localization in a reverberant environment
Author :
Aoki, Mariko ; Furuya, Ken´ichi
Author_Institution :
NTT Cyber Space Labs., NTT Corp., Tokyo, Japan
fYear :
2002
fDate :
2002
Firstpage :
475
Lastpage :
484
Abstract :
We propose a real-time source separation method that works well even under reverberant conditions. Previously, we proposed a method called SAFIA, which segregates sound sources by using sound localization cues acquired by multiple microphones. Under reverberant conditions, SAFIA suffers from "spectral overlap caused by reverberation", which introduces distortion into the separated speech signals. Extending the concept of SAFIA, we propose a new method (WAFD-SAFIA) based on simple signal-processing operations. WAFD-SAFIA significantly reduces the effects of "spectral overlap caused by reverberation". Computing the SNR (signal-to-noise ratio) and SDR (signal-to-distortion ratio) for both methods, we found that this new method outperformed SAFIA in a realistic environment. Moreover, to clarify the effect of frequency resolution on SAFIA, we determined whether a given frequency resolution decreased the overlap between the frequency components of two speech signals.
Keywords :
acoustic noise; microphones; real-time systems; reverberation; signal resolution; source separation; spectral analysis; speech processing; SAFIA; SNR; WAFD-SAFIA; beam patterns effect; cross-channel signal interference; distortion; frequency components; frequency resolution; multiple microphones; real-time source separation; reverberant conditions; reverberant environment; signal processing; signal-to-distortion ratio; signal-to-noise ratio; sound localization; sound localization cues; sound sources; spectral overlap; speech signals; Acoustic noise; Distortion; Frequency; Microphones; Signal resolution; Signal to noise ratio; Source separation; Speech; Statistics; Transfer functions;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks for Signal Processing, 2002. Proceedings of the 2002 12th IEEE Workshop on
Print_ISBN :
0-7803-7616-1
Type :
conf
DOI :
10.1109/NNSP.2002.1030059
Filename :
1030059
Link To Document :
بازگشت