Title :
Towards Effective Content-Based Music Retrieval With Multiple Acoustic Feature Combination
Author :
Shen, Jialie ; Shepherd, John ; Ngu, Anne H H
Author_Institution :
Sch. of Comput. Sci. & Eng., New South Wales Univ., Sydney, NSW
Abstract :
In this paper, we present a new approach to constructing music descriptors to support efficient content-based music retrieval and classification. The system applies multiple musical properties combined with a hybrid architecture based on principal component analysis (PCA) and a multilayer perceptron neural network. This architecture enables straightforward incorporation of multiple musical feature vectors, based on properties such as timbral texture, pitch, and rhythm structure, into a single low-dimensioned vector that is more effective for classification than the larger individual feature vectors. The use of supervised training enables incorporation of human musical perception that further enhances the classification process. We compare our approach with state of the art techniques and demonstrate its effectiveness on content-based music retrieval. In addition, extensive experimental study illustrates its effectiveness and robustness against various kinds of audio alteration
Keywords :
acoustic signal processing; audio databases; content-based retrieval; learning (artificial intelligence); multilayer perceptrons; multimedia databases; music; pattern classification; principal component analysis; PCA; audio alteration; content-based music retrieval; human musical perception; multilayer perceptron neural network; multimedia database; multiple acoustic feature combination; music classification; music descriptor; principal component analysis; supervised training; Computer science; Content based retrieval; Humans; Multilayer perceptrons; Multiple signal classification; Music information retrieval; Neural networks; Principal component analysis; Rhythm; Robustness; Classification; multimedia database; music retrieval;
Journal_Title :
Multimedia, IEEE Transactions on
DOI :
10.1109/TMM.2006.884618