• DocumentCode
    1037643
  • Title

    Musical Genre Classification Using Nonnegative Matrix Factorization-Based Features

  • Author

    Holzapfel, André ; Stylianou, Yannis

  • Author_Institution
    Inst. of Comput. Sci., Heraklion
  • Volume
    16
  • Issue
    2
  • fYear
    2008
  • Firstpage
    424
  • Lastpage
    434
  • Abstract
    Nonnegative matrix factorization (NMF) is used to derive a novel description for the timbre of musical sounds. Using NMF, a spectrogram is factorized providing a characteristic spectral basis. Assuming a set of spectrograms given a musical genre, the space spanned by the vectors of the obtained spectral bases is modeled statistically using mixtures of Gaussians, resulting in a description of the spectral base for this musical genre. This description is shown to improve classification results by up to 23.3% compared to MFCC-based models, while the compression performed by the factorization decreases training time significantly. Using a distance-based stability measure this compression is shown to reduce the noise present in the data set resulting in more stable classification models. In addition, we compare the mean squared errors of the approximation to a spectrogram using independent component analysis and nonnegative matrix factorization, showing the superiority of the latter approach.
  • Keywords
    Gaussian processes; audio signal processing; independent component analysis; matrix decomposition; mean square error methods; music; signal classification; spectral analysis; Gaussian mixture; distance-based stability measure; independent component analysis; mean squared error; musical genre classification; musical sound timbre; nonnegative matrix factorization-based feature; spectral basis; spectrogram approximation; Computer science; Digital recording; Gaussian processes; Instruments; Internet; Mood; Music information retrieval; Rhythm; Spectrogram; Timbre; Audio classification; audio feature extraction; music information retrieval; nonnegative matrix factorization;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2007.909434
  • Filename
    4432640