Beyond the Narrowband Approximation: Wideband Convex Methods for Under-Determined Reverberant Audio Source Separation

Author

Kowalski, Matthieu ; Vincent, Emmanuel ; Gribonval, Rémi

Author_Institution

Lab. des Signaux et Syst., Univ Paris-Sud, Gif-sur-Yvette, France

Volume

18

Issue

7

fYear

2010

Firstpage

1818

Lastpage

1829

Abstract

We consider the problem of extracting the source signals from an under-determined convolutive mixture assuming known mixing filters. State-of-the-art methods operate in the time-frequency domain and rely on narrowband approximation of the convolutive mixing process by complex-valued multiplication in each frequency bin. The source signals are then estimated by minimizing either a mixture fitting cost or a ℓ₁ source sparsity cost, under possible constraints on the number of active sources. In this paper, we define a wideband ℓ₂ mixture fitting cost circumventing the above approximation and investigate the use of a ℓ_1,2 mixed-norm cost promoting disjointness of the source time-frequency representations. We design a family of convex functionals combining these costs and derive suitable optimization algorithms. Experiments indicate that the proposed wideband methods result in a signal-to-distortion ratio improvement of 2 to 5 dB compared to the state-of-the-art on reverberant speech mixtures.

Keywords

audio signal processing; source separation; mixing filters; mixture fitting; narrowband approximation; signal-to-distortion ratio; source signals; under-determined convolutive mixture; under-determined reverberant audio source separation; wideband convex methods; wideband methods; Algorithm design and analysis; Cost function; Design optimization; Filters; Frequency estimation; Narrowband; Source separation; Speech; Time frequency analysis; Wideband; Convex optimization; convolutive mixture; mixed norms; narrowband approximation; source separation;

fLanguage

English

Journal_Title

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher

ieee

ISSN

1558-7916

Type

jour

DOI

10.1109/TASL.2010.2050089

Filename

5462892