Title :
Informed Source Separation of Linear Instantaneous Under-Determined Audio Mixtures by Source Index Embedding
Author :
Parvaix, Mathieu ; Girin, Laurent
Author_Institution :
Grenoble Lab. of Image, Speech, Signal, & Autom. (GIPSA-Lab.), Grenoble Inst. of Technol., Grenoble, France
Abstract :
In this paper, we address the issue of underdetermined source separation of I nonstationary audio sources from a J -channel linear instantaneous mixture (J <; I). This problem is addressed with a specific coder-decoder configuration. At the coder, source signals are assumed to be available before the mixing is processed. A time-frequency (TF) joint analysis of each source signal and mixture signal enables to select the subset of sources (among I ) leading to the best separation results in each TF region. A corresponding source(s) index code is imperceptibly embedded into the mix signal using a watermarking technique. At the decoder, where the original source signals are unknown, the extraction of the watermark enables to invert the mixture in each TF region to recover the source signals. With such an informed approach, it is shown that five instruments and singing voice signals can be efficiently separated from two-channel stereo mixtures, with a quality that significantly overcomes the quality obtained by a semi-blind reference method and enables separate manipulation of the source signals during stereo music restitution (i.e., remixing).
Keywords :
audio coding; audio watermarking; source coding; source separation; time-frequency analysis; J-channel linear instantaneous mixture; TF joint analysis; corresponding sources index code; informed source separation; linear instantaneous under-determined audio mixtures; mix signal; mixing; mixture signal; nonstationary audio sources; semiblind reference method; separate manipulation; singing voice signals; source index embedding; source signals; specific coder-decoder configuration; stereo music restitution; time-frequency joint analysis; two-channel stereo mixtures; underdetermined source separation; watermarking technique; Decoding; Indexes; Multiple signal classification; Source separation; Speech; Time frequency analysis; Watermarking; Audio processing; remixing; under-determined source separation; watermarking;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2010.2097250