DocumentCode
78128
Title
A Simple Method to Determine if a Music Information Retrieval System is a “Horse”
Author
Sturm, Bob L.
Author_Institution
Audio Anal. Lab., Aalborg Univ. Copenhagen, Copenhagen, Denmark
Volume
16
Issue
6
fYear
2014
fDate
Oct. 2014
Firstpage
1636
Lastpage
1644
Abstract
We propose and demonstrate a simple method to explain the figure of merit (FoM) of a music information retrieval (MIR) system evaluated in a dataset, specifically, whether the FoM comes from the system using characteristics confounded with the “ground truth” of the dataset. Akin to the controlled experiments designed to test the supposed mathematical ability of the famous horse “Clever Hans,” we perform two experiments to show how three state-of-the-art MIR systems produce excellent FoM in spite of not using musical knowledge. This provides avenues for improving MIR systems, as well as their evaluation. We make available a reproducible research package so that others can apply the same method to evaluating other MIR systems.
Keywords
information retrieval systems; music; Clever Hans; FoM; MIR evaluation; MIR system; dataset ground truth; figure-of-merit; music information retrieval system; musical knowledge; Accuracy; Feature extraction; Multiple signal classification; Semantics; Silicon; Standards; Vocabulary; 2-WORK system performance; 5-CONT content description and annotation; 5-SEAR multimedia search and retrieval;
fLanguage
English
Journal_Title
Multimedia, IEEE Transactions on
Publisher
ieee
ISSN
1520-9210
Type
jour
DOI
10.1109/TMM.2014.2330697
Filename
6847693
Link To Document