• DocumentCode
    78128
  • Title

    A Simple Method to Determine if a Music Information Retrieval System is a “Horse”

  • Author

    Sturm, Bob L.

  • Author_Institution
    Audio Anal. Lab., Aalborg Univ. Copenhagen, Copenhagen, Denmark
  • Volume
    16
  • Issue
    6
  • fYear
    2014
  • fDate
    Oct. 2014
  • Firstpage
    1636
  • Lastpage
    1644
  • Abstract
    We propose and demonstrate a simple method to explain the figure of merit (FoM) of a music information retrieval (MIR) system evaluated in a dataset, specifically, whether the FoM comes from the system using characteristics confounded with the “ground truth” of the dataset. Akin to the controlled experiments designed to test the supposed mathematical ability of the famous horse “Clever Hans,” we perform two experiments to show how three state-of-the-art MIR systems produce excellent FoM in spite of not using musical knowledge. This provides avenues for improving MIR systems, as well as their evaluation. We make available a reproducible research package so that others can apply the same method to evaluating other MIR systems.
  • Keywords
    information retrieval systems; music; Clever Hans; FoM; MIR evaluation; MIR system; dataset ground truth; figure-of-merit; music information retrieval system; musical knowledge; Accuracy; Feature extraction; Multiple signal classification; Semantics; Silicon; Standards; Vocabulary; 2-WORK system performance; 5-CONT content description and annotation; 5-SEAR multimedia search and retrieval;
  • fLanguage
    English
  • Journal_Title
    Multimedia, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1520-9210
  • Type

    jour

  • DOI
    10.1109/TMM.2014.2330697
  • Filename
    6847693