DocumentCode
3256337
Title
Music Similarity Estimation with the Mean-Covariance Restricted Boltzmann Machine
Author
Schlüter, Jan ; Osendorfer, Christian
Author_Institution
Tech. Univ. Munchen, Munich, Germany
Volume
2
fYear
2011
fDate
18-21 Dec. 2011
Firstpage
118
Lastpage
123
Abstract
Existing content-based music similarity estimation methods largely build on complex hand-crafted feature extractors, which are difficult to engineer. As an alternative, unsupervised machine learning allows to learn features empirically from data. We train a recently proposed model, the mean-covariance Restricted Boltzmann Machine, on music spectrogram excerpts and employ it for music similarity estimation. In k-NN based genre retrieval experiments on three datasets, it clearly outperforms MFCC-based methods, beats simple unsupervised feature extraction using k-Means and comes close to the state-of-the-art. This shows that unsupervised feature extraction poses a viable alternative to engineered features.
Keywords
Boltzmann machines; content-based retrieval; feature extraction; learning (artificial intelligence); music; pattern classification; MFCC based methods; complex hand crafted feature extractors; content based music similarity estimation; k-NN based genre retrieval experiments; k-means; mean covariance restricted boltzmann machine; music spectrogram excerpts; unsupervised feature extraction; unsupervised machine learning; Data models; Estimation; Feature extraction; Hidden Markov models; Histograms; Spectrogram; Training; MIR; mcRBM; music similarity; unsupervised feature extraction;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Applications and Workshops (ICMLA), 2011 10th International Conference on
Conference_Location
Honolulu, HI
Print_ISBN
978-1-4577-2134-2
Type
conf
DOI
10.1109/ICMLA.2011.102
Filename
6147059
Link To Document