DocumentCode :
2706510
Title :
Fast Recognition of Remixed Music Audio
Author :
Casey, Michael ; Slaney, M.
Author_Institution :
Dept. of Comput., London Univ., UK
Volume :
4
fYear :
2007
fDate :
15-20 April 2007
Abstract :
We present an efficient algorithm for automatically detecting remixes of pop songs in large commercial collections. Remixes are closely related as commercial products but they are not closely related in their audio spectral content because of the nature of the remixing process. Therefore spectral modelling approaches to audio similarity fail to recognize them. We propose a new approach - that chops songs into small chunks called audio shingles - to recognize remixed songs. We model the distribution of pair-wise distances between shingles by two independent processes - one corresponding to remix content and the other corresponding to non-remix content in a database. A nearest neighbour algorithm groups songs if they share shingles drawn from the remix process. Our results show 1) log-chromagram shingles separate remixed from non-remixed content with 75%-75% precision-recall performance, cepstral coefficient features do not separate the two distributions adequately 2) increasing the observations from the remix distribution increases the separability. Efficient implementation follows from the separability of the distributions using locality sensitive hashing (LSH) which speeds up automatic grouping of remixes by between one to two orders of magnitude in a 2018-song test set.
Keywords :
audio signal processing; file organisation; music; audio shingles; audio spectral content; cepstral coefficient features; fast recognition; locality sensitive hashing; log-chromagram shingles; neighbour algorithm groups songs; pop songs; remix process; remixed music audio; spectral modelling approaches; Automatic testing; Cepstral analysis; Fingerprint recognition; Marketing and sales; Multiple signal classification; Radio navigation; Search engines; Spatial databases; Statistical distributions; Web sites; Databases; LSH; Music; Shingles; Statistics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Type :
conf
DOI :
10.1109/ICASSP.2007.367347
Filename :
4218378
Link To Document :
بازگشت