Title :
Structural Segmentation of Musical Audio by Constrained Clustering
Author :
Levy, Mark ; Sandler, Mark
Author_Institution :
Dept. of Electron. Eng., Queen Mary Univ. of London, London
Abstract :
We describe a method of segmenting musical audio into structural sections based on a hierarchical labeling of spectral features. Frames of audio are first labeled as belonging to one of a number of discrete states using a hidden Markov model trained on the features. Histograms of neighboring frames are then clustered into segment-types representing distinct distributions of states, using a clustering algorithm in which temporal continuity is expressed as a set of constraints modeled by a hidden Markov random field. We give experimental results which show that in many cases the resulting segmentations correspond well to conventional notions of musical form. We show further how the constrained clustering approach can easily be extended to include prior musical knowledge, input from other machine approaches, or semi-supervision.
Keywords :
audio signal processing; hidden Markov models; music; clustering algorithm; constrained clustering; hidden Markov model; hierarchical labeling; musical audio; structural segmentation; Audio recording; Bridges; Clustering algorithms; Hidden Markov models; Histograms; Instruments; Labeling; Multiple signal classification; Music information retrieval; Pattern recognition; Audio; clustering; music; segmentation;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2007.910781