DocumentCode
2802334
Title
Summarizing popular music via structural similarity analysis
Author
Cooper, Matthew ; Foote, Jonathan
Author_Institution
FX Palo Alto Lab., CA, USA
fYear
2003
fDate
19-22 Oct. 2003
Firstpage
127
Lastpage
130
Abstract
We present a framework for summarizing digital media based on structural analysis. Though these methods are applicable to general media, we concentrate here on characterizing the repetitive structure in popular music. In the first step, a similarity matrix is calculated from interframe spectral similarity. Segment boundaries, such as verse-chorus transitions, are found by correlating a kernel along the diagonal of the matrix. Once segmented, spectral statistics of each segment are computed. In the second step, segments are clustered, based on the pairwise similarity of their statistics, using a matrix decomposition. Finally, the audio is summarized by combining segments representing the clusters most frequently repeated throughout the piece. We present results on a small corpus showing more than 90% correct detection of verse and chorus segments.
Keywords
audio signal processing; matrix decomposition; music; pattern classification; pattern clustering; spectral analysis; statistical analysis; audio signal processing; chorus segments; digital media summarizing; interframe spectral similarity; matrix decomposition; popular music summarizing; segment boundaries; segment clustering; similarity matrix; spectral statistics; structural similarity analysis; verse segments; verse-chorus transitions; Aging; Digital audio players; Frequency; Home computing; Laboratories; Matrix decomposition; Multiple signal classification; Statistics; Streaming media; Technology management;
fLanguage
English
Publisher
ieee
Conference_Titel
Applications of Signal Processing to Audio and Acoustics, 2003 IEEE Workshop on.
Print_ISBN
0-7803-7850-4
Type
conf
DOI
10.1109/ASPAA.2003.1285836
Filename
1285836
Link To Document