Title :
Cross-correlation of beat-synchronous representations for music similarity
Author :
Ellis, Daniel P W ; Cotton, Courtenay V. ; Mandel, Michael I.
Author_Institution :
Dept. of Electr. Eng., Columbia Univ., New York, NY
fDate :
March 31 2008-April 4 2008
Abstract :
Systems to predict human judgments of music similarity directly from the audio have generally been based on the global statistics of spectral feature vectors i.e. collapsing any large-scale temporal structure in the data. Based on our work in identifying alternative ("cover") versions of pieces, we investigate using direct correlation of beat-synchronous representations of music audio to find segments that are similar not only in feature statistics, but in the relative positioning of those features in tempo-normalized time. Given a large enough search database, good matches by this metric should have very high perceived similarity to query items. We evaluate our system through a listening test in which subjects rated system-generated matches as similar or not similar, and compared results to a more conventional timbral and rhythmic similarity baseline, and to random selections.
Keywords :
acoustic correlation; audio databases; audio signal processing; dynamic programming; music; query processing; acoustic signal analysis; beat-synchronous representation; cross-correlation; dynamic programming; music audio similarity prediction; query items; search database; tempo-normalized time; Acoustic testing; Cotton; Humans; Instruments; Large-scale systems; Multiple signal classification; Music; Spatial databases; Statistics; System testing; Acoustic signal analysis; Correlation; Database searching; Dynamic programming; Music;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4517545