DocumentCode
2537701
Title
Musical instrument classification using non-negative matrix factorization algorithms
Author
Benetos, Emmanouil ; Kotti, Margarita ; Kotropoulos, Constantine
Author_Institution
Dept. of Informatics, Aristotle Univ., Thessaloniki
fYear
2006
fDate
21-24 May 2006
Abstract
In this paper, a class of algorithms for automatic classification of individual musical instrument sounds is presented. Several perceptual features used in general sound classification applications were measured for 300 sound recordings consisting of 6 different musical instrument classes (piano, violin, cello, flute, bassoon and soprano saxophone). In addition, MPEG-7 basic spectral and spectral basis descriptors were considered, providing an effective combination for accurately describing the spectral and timbral audio characteristics. The audio files were split using 70% of the available data for training and the remaining 30% for testing. A classifier was developed based on non-negative matrix factorization (NMF) techniques, thus introducing a novel application of NMF. The standard NMF method was examined, as well as its modifications: the local, the sparse, and the discriminant NMF. Experimental results are presented to compare MPEG-7 spectral basis representations with MPEG-7 basic spectral features alongside the various NMF algorithms. The results indicate that the use of the spectrum projection coefficients for feature extraction and the standard NMF classifier yields an accuracy exceeding 95%
Keywords
audio signal processing; matrix decomposition; musical instruments; signal classification; spectral analysis; MPEG-7 basic spectral descriptors; MPEG-7 basic spectral features; MPEG-7 spectral basis representations; NMF classifier; automatic sound classification; musical instrument classification; musical instrument sounds; nonnegative matrix factorization algorithm; spectral basis descriptors; spectral characteristics; spectrum projection coefficient; timbral audio characteristics; Application specific processors; Artificial intelligence; Audio databases; Electronic mail; Informatics; Information analysis; Instruments; Laboratories; MPEG 7 Standard; Spatial databases;
fLanguage
English
Publisher
ieee
Conference_Titel
Circuits and Systems, 2006. ISCAS 2006. Proceedings. 2006 IEEE International Symposium on
Conference_Location
Island of Kos
Print_ISBN
0-7803-9389-9
Type
conf
DOI
10.1109/ISCAS.2006.1692967
Filename
1692967
Link To Document