• DocumentCode
    118186
  • Title

    Hybrid multichannel signal separation using supervised nonnegative matrix factorization with spectrogram restoration

  • Author

    Kitamura, Daichi ; Saruwatari, Hiroshi ; Nakamura, Satoshi ; Takahashi, Yu. ; Kondo, Kazunobu ; Kameoka, Hirokazu

  • Author_Institution
    Grad. Univ. for Adv. Studies, Tokyo, Japan
  • fYear
    2014
  • fDate
    9-12 Dec. 2014
  • Firstpage
    1
  • Lastpage
    10
  • Abstract
    In this paper, we propose a new hybrid method that concatenates directional clustering and advanced nonnegative matrix factorization (NMF) for the purpose of the specific sound extraction from the multichannel music signal. Multichannel music signal separation technology is aimed to extract a specific target signal from observed multichannel signals that contain multiple instrumental sounds. In the previous studies, various methods using NMF have been proposed, but they remain many problems, e.g., poor convergence in update rules in NMF and lack of robustness. To solve these problems, we propose a new supervised NMF (SNMF) with spectrogram restoration and its hybrid method that concatenates the proposed SNMF after directional clustering. Via extrapolation of supervised spectral bases, the proposed SNMF attempts both target signal separation and reconstruction of the lost target components, which are generated by preceding directional clustering. In addition, we theoretically reveal the trade-off between separation and extrapolation abilities and propose a new scheme for multi-divergence, where optimal divergence can be automatically changed in each time frame according to the local spatial conditions. The results of an evaluation experiment show that our proposed hybrid method outperforms the conventional music signal separation methods.
  • Keywords
    audio signal processing; extrapolation; matrix decomposition; signal reconstruction; source separation; statistical analysis; NMF; SNMF; concatenate directional clustering; extrapolation; hybrid method; hybrid multichannel music signal separation; local spatial conditions; lost target component; multidivergence scheme; multiple instrumental sounds; poor convergence; signal reconstruction; sound extraction; spectrogram restoration; supervised advance nonnegative matrix factorization; supervised spectral bases; Cost function; Extrapolation; Indexes; Instruments; Matrix decomposition; Source separation; Spectrogram;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Asia-Pacific Signal and Information Processing Association, 2014 Annual Summit and Conference (APSIPA)
  • Conference_Location
    Siem Reap
  • Type

    conf

  • DOI
    10.1109/APSIPA.2014.7041664
  • Filename
    7041664