• DocumentCode
    1469236
  • Title

    Unifying Low-Level and High-Level Music Similarity Measures

  • Author

    Bogdanov, Dmitry ; Serrà, Joan ; Wack, Nicolas ; Herrera, Perfecto ; Serra, Xavier

  • Author_Institution
    Music Technol. Group, Univ. Pompeu Fabra, Barcelona, Spain
  • Volume
    13
  • Issue
    4
  • fYear
    2011
  • Firstpage
    687
  • Lastpage
    701
  • Abstract
    Measuring music similarity is essential for multimedia retrieval. For music items, this task can be regarded as obtaining a suitable distance measurement between songs defined on a certain feature space. In this paper, we propose three of such distance measures based on the audio content: first, a low-level measure based on tempo-related description; second, a high-level semantic measure based on the inference of different musical dimensions by support vector machines. These dimensions include genre, culture, moods, instruments, rhythm, and tempo annotations. Third, a hybrid measure which combines the above-mentioned distance measures with two existing low-level measures: a Euclidean distance based on principal component analysis of timbral, temporal, and tonal descriptors, and a timbral distance based on single Gaussian Mel-frequency cepstral coefficient (MFCC) modeling. We evaluate our proposed measures against a number of baseline measures. We do this objectively based on a comprehensive set of music collections, and subjectively based on listeners´ ratings. Results show that the proposed methods achieve accuracies comparable to the baseline approaches in the case of the tempo and classifier-based measures. The highest accuracies are obtained by the hybrid distance. Furthermore, the proposed classifier-based approach opens up the possibility to explore distance measures that are based on semantic notions.
  • Keywords
    cepstral analysis; distance measurement; geometry; information retrieval; multimedia computing; music; principal component analysis; support vector machines; Euclidean distance; Gaussian mel-frequency cepstral coefficient modeling; audio content:; culture; distance measurement; genre; instruments; moods; multimedia retrieval; music similarity measures; musical dimensions; principal component analysis; rhythm; support vector machines; tempo annotations; tempo-related description; temporal; timbral distance; tonal descriptors; Euclidean distance; Mel frequency cepstral coefficient; Mood; Principal component analysis; Semantics; Support vector machines; Distance measurement; information retrieval; knowledge acquisition; multimedia computing; multimedia databases; music;
  • fLanguage
    English
  • Journal_Title
    Multimedia, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1520-9210
  • Type

    jour

  • DOI
    10.1109/TMM.2011.2125784
  • Filename
    5728926