• DocumentCode
    3403787
  • Title

    Cross-correlation of beat-synchronous representations for music similarity

  • Author

    Ellis, Daniel P W ; Cotton, Courtenay V. ; Mandel, Michael I.

  • Author_Institution
    Dept. of Electr. Eng., Columbia Univ., New York, NY
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    57
  • Lastpage
    60
  • Abstract
    Systems to predict human judgments of music similarity directly from the audio have generally been based on the global statistics of spectral feature vectors i.e. collapsing any large-scale temporal structure in the data. Based on our work in identifying alternative ("cover") versions of pieces, we investigate using direct correlation of beat-synchronous representations of music audio to find segments that are similar not only in feature statistics, but in the relative positioning of those features in tempo-normalized time. Given a large enough search database, good matches by this metric should have very high perceived similarity to query items. We evaluate our system through a listening test in which subjects rated system-generated matches as similar or not similar, and compared results to a more conventional timbral and rhythmic similarity baseline, and to random selections.
  • Keywords
    acoustic correlation; audio databases; audio signal processing; dynamic programming; music; query processing; acoustic signal analysis; beat-synchronous representation; cross-correlation; dynamic programming; music audio similarity prediction; query items; search database; tempo-normalized time; Acoustic testing; Cotton; Humans; Instruments; Large-scale systems; Multiple signal classification; Music; Spatial databases; Statistics; System testing; Acoustic signal analysis; Correlation; Database searching; Dynamic programming; Music;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-1483-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2008.4517545
  • Filename
    4517545