DocumentCode :
1037728
Title :
Structural Segmentation of Musical Audio by Constrained Clustering
Author :
Levy, Mark ; Sandler, Mark
Author_Institution :
Dept. of Electron. Eng., Queen Mary Univ. of London, London
Volume :
16
Issue :
2
fYear :
2008
Firstpage :
318
Lastpage :
326
Abstract :
We describe a method of segmenting musical audio into structural sections based on a hierarchical labeling of spectral features. Frames of audio are first labeled as belonging to one of a number of discrete states using a hidden Markov model trained on the features. Histograms of neighboring frames are then clustered into segment-types representing distinct distributions of states, using a clustering algorithm in which temporal continuity is expressed as a set of constraints modeled by a hidden Markov random field. We give experimental results which show that in many cases the resulting segmentations correspond well to conventional notions of musical form. We show further how the constrained clustering approach can easily be extended to include prior musical knowledge, input from other machine approaches, or semi-supervision.
Keywords :
audio signal processing; hidden Markov models; music; clustering algorithm; constrained clustering; hidden Markov model; hierarchical labeling; musical audio; structural segmentation; Audio recording; Bridges; Clustering algorithms; Hidden Markov models; Histograms; Instruments; Labeling; Multiple signal classification; Music information retrieval; Pattern recognition; Audio; clustering; music; segmentation;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2007.910781
Filename :
4432648
Link To Document :
بازگشت