Title :
Blind Source Separation Exploiting Higher-Order Frequency Dependencies
Author :
Kim, Taesu ; Attias, Hagai T. ; Lee, Soo-Young ; Lee, Te-Won
Author_Institution :
Dept. of Biosystems, Korea Adv. Inst. of Sci. & Technol., Dajeon
Abstract :
Blind source separation (BSS) is a challenging problem in real-world environments where sources are time delayed and convolved. The problem becomes more difficult in very reverberant conditions, with an increasing number of sources, and geometric configurations of the sources such that finding directionality is not sufficient for source separation. In this paper, we propose a new algorithm that exploits higher order frequency dependencies of source signals in order to separate them when they are mixed. In the frequency domain, this formulation assumes that dependencies exist between frequency bins instead of defining independence for each frequency bin. In this manner, we can avoid the well-known frequency permutation problem. To derive the learning algorithm, we define a cost function, which is an extension of mutual information between multivariate random variables. By introducing a source prior that models the inherent frequency dependencies, we obtain a simple form of a multivariate score function. In experiments, we generate simulated data with various kinds of sources in various environments. We evaluate the performances and compare it with other well-known algorithms. The results show the proposed algorithm outperforms the others in most cases. The algorithm is also able to accurately recover six sources with six microphones. In this case, we can obtain about 16-dB signal-to-interference ratio (SIR) improvement. Similar performance is observed in real conference room recordings with three human speakers reading sentences and one loudspeaker playing music
Keywords :
blind source separation; frequency domain analysis; BSS; blind source separation; frequency permutation problem; higher-order frequency dependencies; multivariate random variables; multivariate score function; signal-to-interference ratio; source signals; Blind source separation; Cost function; Delay effects; Frequency domain analysis; Humans; Microphones; Mutual information; Performance evaluation; Random variables; Source separation; Blind source separation (BSS); cocktail party problem; convolutive mixture; frequency domain; higher order dependency; independent component analysis; permutation problem;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2006.872618