DocumentCode
594695
Title
Perceptually weighted Non-negative Matrix Factorization for blind single-channel music source separation
Author
Kirbiz, Serap ; Gunsel, B.
Author_Institution
Dept. of Electron. & Commun. Eng., Istanbul Tech. Univ., Istanbul, Turkey
fYear
2012
fDate
11-15 Nov. 2012
Firstpage
226
Lastpage
229
Abstract
We propose a blind single-channel musical source separation method that improves perceptual quality of the separated sources. It uses the advantages of subspace learning based on Non-negative Matrix Factor 2-D Deconvolution (NMF2D). To improve the perceptual quality of separation, we propose a weighted divergence type cost function for the optimization that adopts the auditory model defined in ITU-R BS.1387 into the source separation. It is shown that the proposed perceptually weighted NMF2D scheme efficiently clusters the bases of subspace representation corresponding to notes generated by single instruments. Source separation performance has been reported on musical mixtures resulting an improvement in perceptual quality measures.
Keywords
deconvolution; learning (artificial intelligence); matrix decomposition; music; source separation; 2D deconvolution; NMF2D; blind single-channel music source separation; non-negative matrix factorization; perceptual quality; subspace learning; Computational complexity; Humans; Instruments; Source separation; Spectrogram; Time domain analysis; Time frequency analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition (ICPR), 2012 21st International Conference on
Conference_Location
Tsukuba
ISSN
1051-4651
Print_ISBN
978-1-4673-2216-4
Type
conf
Filename
6460113
Link To Document