Title :
Reverberant audio source separation using partially pre-trained nonnegative matrix factorization
Author :
Fakhry, Mahmoud ; Svaizer, Piergiorgio ; Omologo, Maurizio
Author_Institution :
Doctoral Sch. of ICT, Univ. of Trento, Trento, Italy
Abstract :
This work addresses the problem of underdetermined audio source separation exploiting source-based prior information. To solve the problem by following local Gaussian modeling of a mixing process, the covariance matrix of a mixture of audio sources is parameterized by source variances and spatial covariance matrices. An iterative algorithm is proposed to estimate the model parameters in the maximum likelihood (ML) sense by reformulating the parameters and using the prior information. Applying a matrix factorization algorithm, such as nonnegative matrix factorization (NMF), the source variance can be represented as multiplication of two matrices, a spectral basis matrix and a time-varying coefficient matrix. The basis matrix of each source signal is trained in advance using a set of training data, while a corrupted copy of the coefficient matrix is estimated during the separation process. Moreover, the pre-trained information is exploited to move the amplitude indeterminacy of the spatial covariance matrix to the coefficient matrix domain. The proposed algorithm was evaluated using simulated and real mixing conditions, and it provided a high performance in reverberant environments.
Keywords :
Gaussian processes; audio signal processing; covariance matrices; iterative methods; matrix decomposition; maximum likelihood estimation; reverberation; source separation; iterative algorithm; local Gaussian modeling; maximum likelihood estimation; nonnegative matrix factorization; prior information; reverberant audio source separation; source variance; spatial covariance matrix; spectral basis matrix; time-varying coefficient matrix domain; Acoustics; Covariance matrices; Microphones; Minimization; Source separation; Speech; Time-frequency analysis; Prior information; audio source separation; coefficient matrix domain; nonnegative matrix factorization;
Conference_Titel :
Acoustic Signal Enhancement (IWAENC), 2014 14th International Workshop on
Conference_Location :
Juan-les-Pins
DOI :
10.1109/IWAENC.2014.6954301