مرکز منطقه ای اطلاع رساني علوم و فناوري - Kernel spectrogram models for source separation

DocumentCode :

164807

Title :

Kernel spectrogram models for source separation

Author :

Liutkus, Antoine ; Rafii, Zafar ; Pardo, Bryan ; Fitzgerald, D. ; Daudet, Laurent

Author_Institution :

Inria, Villers-lès-Nancy, France

fYear :

2014

fDate :

12-14 May 2014

Firstpage :

Lastpage :

Abstract :

In this study, we introduce a new framework called Kernel Additive Modelling for audio spectrograms that can be used for multichannel source separation. It assumes that the spectrogram of a source at any time-frequency bin is close to its value in a neighbourhood indicated by a source-specific proximity kernel. The rationale for this model is to easily account for features like periodicity, stability over time or frequency, self-similarity, etc. In many cases, such local dynamics are indeed much more natural to assess than any global model such as a tensor factorization. This framework permits one to use different proximity kernels for different sources and to estimate them blindly using their mixtures only. Estimation is performed using a variant of the kernel backfitting algorithm that allows for multichannel mixtures and permits parallelization. Experimental results on the separation of vocals from musical backgrounds demonstrate the efficiency of the approach.

Keywords :

audio signal processing; source separation; Kernel additive modelling; Kernel spectrogram models; audio spectrograms; kernel backfitting algorithm; multichannel mixtures; multichannel source separation; musical backgrounds; source separation; tensor factorization; time-frequency; vocal separation; Conferences; Kernel; Source separation; Spectrogram; Speech; Speech processing; Time-frequency analysis; audio source separation; spatial filtering; spectrogram models;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Hands-free Speech Communication and Microphone Arrays (HSCMA), 2014 4th Joint Workshop on

Conference_Location :

Villers-les-Nancy

Type :

conf

DOI :

10.1109/HSCMA.2014.6843240

Filename :

6843240

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=164807