• DocumentCode
    2372534
  • Title

    Initialization of nonnegative matrix factorization dictionaries for single channel source separation

  • Author

    Grais, E.M. ; Erdogan, H.

  • Author_Institution
    Fac. of Eng. & Natural Sci., Sabanci Univ., Istanbul, Turkey
  • fYear
    2013
  • fDate
    24-26 April 2013
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    In this work, we study different initialization methods for the nonnegative matrix factorization (NMF) dictionaries or bases. There is a need for good initializations for NMF dictionary because NMF decomposition is a non-convex problem which has many local minima. The effect of the initialization of NMF is evaluated in this work on audio source separation applications. In supervised audio source separation, NMF is used to train a set of basis vectors (basis matrix) for each source in an iterative fashion. Then NMF is used to decompose the mixed signal spectrogram as a weighted linear combination of the trained basis vectors for all sources in the mixed signal. The estimate for each source is computed by summing the decomposition terms that include its corresponding trained bases. In this work, we use principal component analysis (PCA), spherical K-means, and fuzzy C-means (FCM) to initialize the NMF basis matrices during the training procedures. Experimental results show that, better initialization for NMF bases gives better audio separation performance than using NMF with random initialization.
  • Keywords
    audio signal processing; concave programming; fuzzy set theory; iterative methods; learning (artificial intelligence); principal component analysis; random processes; singular value decomposition; source separation; vectors; FCM; NMF basis matrix; NMF decomposition; NMF dictionary; audio source separation; basis vector training; basis vectors; channel source separation; fuzzy C-means; iterative method; mixed signal spectrogram decomposition; nonconvex problem; nonnegative matrix factorization; principal component analysis; random initialization; spherical K-means; weighted linear combination; Matrix decomposition; Principal component analysis; Source separation; Spectrogram; Speech; Training; Vectors; Nonnegative matrix factorization; data clustering; dictionary learning; fuzzy clustering; principal component analysis; single channel source separation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Communications Applications Conference (SIU), 2013 21st
  • Conference_Location
    Haspolat
  • Print_ISBN
    978-1-4673-5562-9
  • Electronic_ISBN
    978-1-4673-5561-2
  • Type

    conf

  • DOI
    10.1109/SIU.2013.6531172
  • Filename
    6531172