DocumentCode :
57086
Title :
Multichannel Signal Separation Combining Directional Clustering and Nonnegative Matrix Factorization with Spectrogram Restoration
Author :
Kitamura, Daichi ; Saruwatari, Hiroshi ; Kameoka, Hirokazu ; Takahashi, Yu. ; Kondo, Kazunobu ; Nakamura, Satoshi
Author_Institution :
Dept. of Inf., Grad. Univ. for Adv. Studies(SOKENDAI), Tokyo, Japan
Volume :
23
Issue :
4
fYear :
2015
fDate :
Apr-15
Firstpage :
654
Lastpage :
669
Abstract :
In this paper, to address problems in multichannel music signal separation, we propose a new hybrid method that combines directional clustering and advanced nonnegative matrix factorization (NMF). The aims of multichannel music signal separation technology is to extract a specific target signal from observed multichannel signals that contain multiple instrumental sounds. In previous studies, various methods using NMF have been proposed, but many problems remain including poor separation accuracy and lack of robustness. To solve these problems, we propose a new supervised NMF (SNMF) with spectrogram restoration and a hybrid method that concatenates the proposed SNMF after directional clustering. Via the extrapolation of supervised spectral bases, the proposed SNMF attempts both target signal separation and reconstruction of the lost target components, which are generated by preceding directional clustering. In addition, we experimentally reveal the trade-off between separation and extrapolation abilities and propose a new scheme for adaptive divergence, where the optimal divergence can be automatically changed in each time frame according to the local spatial conditions. The results of an evaluation experiment show that our proposed hybrid method outperforms the conventional music signal separation methods.
Keywords :
extrapolation; matrix decomposition; signal reconstruction; signal restoration; source separation; adaptive divergence; advanced nonnegative matrix factorization; directional clustering; extrapolation; multichannel music signal separation; multichannel music signal separation technology; multichannel signal separation combining directional clustering; music signal separation methods; nonnegative matrix factorization; optimal divergence; spectrogram restoration; supervised NMF; supervised spectral bases; target signal reconstruction; target signal separation; Extrapolation; Matrix decomposition; Source separation; Sparse matrices; Spectrogram; Speech; Speech processing; Multichannel signal separation; music signal processing; nonnegative matrix factorization (NMF); spectrogram restoration;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
2329-9290
Type :
jour
DOI :
10.1109/TASLP.2015.2401425
Filename :
7035017
Link To Document :
بازگشت