• DocumentCode
    57086
  • Title

    Multichannel Signal Separation Combining Directional Clustering and Nonnegative Matrix Factorization with Spectrogram Restoration

  • Author

    Kitamura, Daichi ; Saruwatari, Hiroshi ; Kameoka, Hirokazu ; Takahashi, Yu. ; Kondo, Kazunobu ; Nakamura, Satoshi

  • Author_Institution
    Dept. of Inf., Grad. Univ. for Adv. Studies(SOKENDAI), Tokyo, Japan
  • Volume
    23
  • Issue
    4
  • fYear
    2015
  • fDate
    Apr-15
  • Firstpage
    654
  • Lastpage
    669
  • Abstract
    In this paper, to address problems in multichannel music signal separation, we propose a new hybrid method that combines directional clustering and advanced nonnegative matrix factorization (NMF). The aims of multichannel music signal separation technology is to extract a specific target signal from observed multichannel signals that contain multiple instrumental sounds. In previous studies, various methods using NMF have been proposed, but many problems remain including poor separation accuracy and lack of robustness. To solve these problems, we propose a new supervised NMF (SNMF) with spectrogram restoration and a hybrid method that concatenates the proposed SNMF after directional clustering. Via the extrapolation of supervised spectral bases, the proposed SNMF attempts both target signal separation and reconstruction of the lost target components, which are generated by preceding directional clustering. In addition, we experimentally reveal the trade-off between separation and extrapolation abilities and propose a new scheme for adaptive divergence, where the optimal divergence can be automatically changed in each time frame according to the local spatial conditions. The results of an evaluation experiment show that our proposed hybrid method outperforms the conventional music signal separation methods.
  • Keywords
    extrapolation; matrix decomposition; signal reconstruction; signal restoration; source separation; adaptive divergence; advanced nonnegative matrix factorization; directional clustering; extrapolation; multichannel music signal separation; multichannel music signal separation technology; multichannel signal separation combining directional clustering; music signal separation methods; nonnegative matrix factorization; optimal divergence; spectrogram restoration; supervised NMF; supervised spectral bases; target signal reconstruction; target signal separation; Extrapolation; Matrix decomposition; Source separation; Sparse matrices; Spectrogram; Speech; Speech processing; Multichannel signal separation; music signal processing; nonnegative matrix factorization (NMF); spectrogram restoration;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    2329-9290
  • Type

    jour

  • DOI
    10.1109/TASLP.2015.2401425
  • Filename
    7035017