Title :
Grouping Separated Frequency Components by Estimating Propagation Model Parameters in Frequency-Domain Blind Source Separation
Author :
Sawada, Hiroshi ; Araki, Shoko ; Mukai, Ryo ; Makino, Shoji
Author_Institution :
NTT Corp., Kyoto
fDate :
7/1/2007 12:00:00 AM
Abstract :
This paper proposes a new formulation and optimization procedure for grouping frequency components in frequency-domain blind source separation (BSS). We adopt two separation techniques, independent component analysis (ICA) and time-frequency (T-F) masking, for the frequency-domain BSS. With ICA, grouping the frequency components corresponds to aligning the permutation ambiguity of the ICA solution in each frequency bin. With T-F masking, grouping the frequency components corresponds to classifying sensor observations in the time-frequency domain for individual sources. The grouping procedure is based on estimating anechoic propagation model parameters by analyzing ICA results or sensor observations. More specifically, the time delays of arrival and attenuations from a source to all sensors are estimated for each source. The focus of this paper includes the applicability of the proposed procedure for a situation with wide sensor spacing where spatial aliasing may occur. Experimental results show that the proposed procedure effectively separates two or three sources with several sensor configurations in a real room, as long as the room reverberation is moderately low.
Keywords :
blind source separation; independent component analysis; time-frequency analysis; anechoic propagation model; frequency component grouping; frequency component separation; frequency-domain blind source separation; independent component analysis; propagation model parameter estimation; time-frequency domain; time-frequency masking; wide sensor spacing; Attenuation; Blind source separation; Delay effects; Delay estimation; Frequency domain analysis; Frequency estimation; Independent component analysis; Parameter estimation; Source separation; Time frequency analysis; Blind source separation (BSS); convolutive mixture; frequency domain; generalized cross correlation; independent component analysis (ICA); permutation problem; sparseness; time delay estimation; time–frequency (T–F) masking;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2007.899218