• DocumentCode
    865682
  • Title

    Blind Source Separation Exploiting Higher-Order Frequency Dependencies

  • Author

    Kim, Taesu ; Attias, Hagai T. ; Lee, Soo-Young ; Lee, Te-Won

  • Author_Institution
    Dept. of Biosystems, Korea Adv. Inst. of Sci. & Technol., Dajeon
  • Volume
    15
  • Issue
    1
  • fYear
    2007
  • Firstpage
    70
  • Lastpage
    79
  • Abstract
    Blind source separation (BSS) is a challenging problem in real-world environments where sources are time delayed and convolved. The problem becomes more difficult in very reverberant conditions, with an increasing number of sources, and geometric configurations of the sources such that finding directionality is not sufficient for source separation. In this paper, we propose a new algorithm that exploits higher order frequency dependencies of source signals in order to separate them when they are mixed. In the frequency domain, this formulation assumes that dependencies exist between frequency bins instead of defining independence for each frequency bin. In this manner, we can avoid the well-known frequency permutation problem. To derive the learning algorithm, we define a cost function, which is an extension of mutual information between multivariate random variables. By introducing a source prior that models the inherent frequency dependencies, we obtain a simple form of a multivariate score function. In experiments, we generate simulated data with various kinds of sources in various environments. We evaluate the performances and compare it with other well-known algorithms. The results show the proposed algorithm outperforms the others in most cases. The algorithm is also able to accurately recover six sources with six microphones. In this case, we can obtain about 16-dB signal-to-interference ratio (SIR) improvement. Similar performance is observed in real conference room recordings with three human speakers reading sentences and one loudspeaker playing music
  • Keywords
    blind source separation; frequency domain analysis; BSS; blind source separation; frequency permutation problem; higher-order frequency dependencies; multivariate random variables; multivariate score function; signal-to-interference ratio; source signals; Blind source separation; Cost function; Delay effects; Frequency domain analysis; Humans; Microphones; Mutual information; Performance evaluation; Random variables; Source separation; Blind source separation (BSS); cocktail party problem; convolutive mixture; frequency domain; higher order dependency; independent component analysis; permutation problem;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2006.872618
  • Filename
    4032777