• DocumentCode
    2537701
  • Title

    Musical instrument classification using non-negative matrix factorization algorithms

  • Author

    Benetos, Emmanouil ; Kotti, Margarita ; Kotropoulos, Constantine

  • Author_Institution
    Dept. of Informatics, Aristotle Univ., Thessaloniki
  • fYear
    2006
  • fDate
    21-24 May 2006
  • Abstract
    In this paper, a class of algorithms for automatic classification of individual musical instrument sounds is presented. Several perceptual features used in general sound classification applications were measured for 300 sound recordings consisting of 6 different musical instrument classes (piano, violin, cello, flute, bassoon and soprano saxophone). In addition, MPEG-7 basic spectral and spectral basis descriptors were considered, providing an effective combination for accurately describing the spectral and timbral audio characteristics. The audio files were split using 70% of the available data for training and the remaining 30% for testing. A classifier was developed based on non-negative matrix factorization (NMF) techniques, thus introducing a novel application of NMF. The standard NMF method was examined, as well as its modifications: the local, the sparse, and the discriminant NMF. Experimental results are presented to compare MPEG-7 spectral basis representations with MPEG-7 basic spectral features alongside the various NMF algorithms. The results indicate that the use of the spectrum projection coefficients for feature extraction and the standard NMF classifier yields an accuracy exceeding 95%
  • Keywords
    audio signal processing; matrix decomposition; musical instruments; signal classification; spectral analysis; MPEG-7 basic spectral descriptors; MPEG-7 basic spectral features; MPEG-7 spectral basis representations; NMF classifier; automatic sound classification; musical instrument classification; musical instrument sounds; nonnegative matrix factorization algorithm; spectral basis descriptors; spectral characteristics; spectrum projection coefficient; timbral audio characteristics; Application specific processors; Artificial intelligence; Audio databases; Electronic mail; Informatics; Information analysis; Instruments; Laboratories; MPEG 7 Standard; Spatial databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Circuits and Systems, 2006. ISCAS 2006. Proceedings. 2006 IEEE International Symposium on
  • Conference_Location
    Island of Kos
  • Print_ISBN
    0-7803-9389-9
  • Type

    conf

  • DOI
    10.1109/ISCAS.2006.1692967
  • Filename
    1692967