Title :
Summarizing popular music via structural similarity analysis
Author :
Cooper, Matthew ; Foote, Jonathan
Author_Institution :
FX Palo Alto Lab., CA, USA
Abstract :
We present a framework for summarizing digital media based on structural analysis. Though these methods are applicable to general media, we concentrate here on characterizing the repetitive structure in popular music. In the first step, a similarity matrix is calculated from interframe spectral similarity. Segment boundaries, such as verse-chorus transitions, are found by correlating a kernel along the diagonal of the matrix. Once segmented, spectral statistics of each segment are computed. In the second step, segments are clustered, based on the pairwise similarity of their statistics, using a matrix decomposition. Finally, the audio is summarized by combining segments representing the clusters most frequently repeated throughout the piece. We present results on a small corpus showing more than 90% correct detection of verse and chorus segments.
Keywords :
audio signal processing; matrix decomposition; music; pattern classification; pattern clustering; spectral analysis; statistical analysis; audio signal processing; chorus segments; digital media summarizing; interframe spectral similarity; matrix decomposition; popular music summarizing; segment boundaries; segment clustering; similarity matrix; spectral statistics; structural similarity analysis; verse segments; verse-chorus transitions; Aging; Digital audio players; Frequency; Home computing; Laboratories; Matrix decomposition; Multiple signal classification; Statistics; Streaming media; Technology management;
Conference_Titel :
Applications of Signal Processing to Audio and Acoustics, 2003 IEEE Workshop on.
Print_ISBN :
0-7803-7850-4
DOI :
10.1109/ASPAA.2003.1285836