• DocumentCode
    1487648
  • Title

    Beyond the Narrowband Approximation: Wideband Convex Methods for Under-Determined Reverberant Audio Source Separation

  • Author

    Kowalski, Matthieu ; Vincent, Emmanuel ; Gribonval, Rémi

  • Author_Institution
    Lab. des Signaux et Syst., Univ Paris-Sud, Gif-sur-Yvette, France
  • Volume
    18
  • Issue
    7
  • fYear
    2010
  • Firstpage
    1818
  • Lastpage
    1829
  • Abstract
    We consider the problem of extracting the source signals from an under-determined convolutive mixture assuming known mixing filters. State-of-the-art methods operate in the time-frequency domain and rely on narrowband approximation of the convolutive mixing process by complex-valued multiplication in each frequency bin. The source signals are then estimated by minimizing either a mixture fitting cost or a 1 source sparsity cost, under possible constraints on the number of active sources. In this paper, we define a wideband ℓ2 mixture fitting cost circumventing the above approximation and investigate the use of a ℓ1,2 mixed-norm cost promoting disjointness of the source time-frequency representations. We design a family of convex functionals combining these costs and derive suitable optimization algorithms. Experiments indicate that the proposed wideband methods result in a signal-to-distortion ratio improvement of 2 to 5 dB compared to the state-of-the-art on reverberant speech mixtures.
  • Keywords
    audio signal processing; source separation; mixing filters; mixture fitting; narrowband approximation; signal-to-distortion ratio; source signals; under-determined convolutive mixture; under-determined reverberant audio source separation; wideband convex methods; wideband methods; Algorithm design and analysis; Cost function; Design optimization; Filters; Frequency estimation; Narrowband; Source separation; Speech; Time frequency analysis; Wideband; Convex optimization; convolutive mixture; mixed norms; narrowband approximation; source separation;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2010.2050089
  • Filename
    5462892