• DocumentCode
    463722
  • Title

    A Statistical Approach to Musical Genre Classification using Non-Negative Matrix Factorization

  • Author

    Holzapfel, A. ; Stylianou, Yannis

  • Author_Institution
    Dept. of Comput. Sci., Crete Univ., Greece
  • Volume
    2
  • fYear
    2007
  • fDate
    15-20 April 2007
  • Abstract
    This paper introduces a new feature set based on a non-negative matrix factorization approach for the classification of musical signals into genres, only using synchronous organization of music events (vertical dimension of music). This feature set generates a vector space to describe the spectrogram representation of a music signal. The space is modeled statistically by a mixture of Gaussians (GMM). A new signal is classified by considering the likelihoods over all the estimated feature vectors given these statistical models, without constructing a model for the signal itself. Cross-validation tests on two commonly utilized datasets for this task show the superiority of the proposed features compared to the widely used MFCC type of representation based on classification accuracies (over 9% of improvement), as well as on a stability measure introduced in this paper for GMM.
  • Keywords
    Gaussian processes; audio signal processing; matrix decomposition; statistical analysis; mixture of Gaussians; musical genre classification; musical signal classification; nonnegative matrix factorization; spectrogram representation; statistical approach; Computer science; Gaussian processes; Instruments; Mel frequency cepstral coefficient; Multiple signal classification; Music; Rhythm; Signal generators; Spatial databases; Spectrogram; Gaussian Mixture Model; MFCC; Music Genre Classification; Non-negative Matrix Factorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
  • Conference_Location
    Honolulu, HI
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0727-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2007.366330
  • Filename
    4217503