Title :
Audio segmentation by feature-space clustering using linear discriminant analysis and dynamic programming
Author :
Goodwin, Michael M. ; Laroche, Jean
Author_Institution :
Creative Adv. Technol. Center, Scotts Valley, CA, USA
Abstract :
We consider the problem of segmenting an audio signal into characteristic regions based on feature-set similarities. In the proposed method, a feature-space representation of the signal is generated; then, sequences of feature-space samples are aggregated into clusters corresponding to distinct signal regions. The clustering of feature sets is improved via linear discriminant analysis (LDA); dynamic programming (DP) is used to derive optimal cluster boundaries. The method avoids the heuristics employed in various feature-space segmentation schemes and is able to derive an optimal segmentation once the LDA and DP cost metrics have been chosen. We demonstrate that the method outperforms typical feature-space approaches described in the literature. We focus on an illustrative example of the basic segmentation task; however, by judicious design of the feature set, the training set, and the dynamic program, the method can be tailored for various applications such as speech/music discrimination, segmentation of audio streams for smart transport, or song structure analysis for thumbnailing.
Keywords :
audio signal processing; dynamic programming; pattern classification; pattern clustering; signal representation; LDA; audio segmentation; characteristic regions; classification; cluster boundaries; dynamic programming; feature-set clustering; feature-set similarities; feature-space clustering; feature-space samples; linear discriminant analysis; signal representation; smart transport; song structure analysis; speech/music discrimination; thumbnails; training set; Algorithm design and analysis; Clustering algorithms; Cost function; Dynamic programming; Linear discriminant analysis; Robustness; Signal analysis; Signal processing; Speech analysis; Streaming media;
Conference_Titel :
Applications of Signal Processing to Audio and Acoustics, 2003 IEEE Workshop on.
Print_ISBN :
0-7803-7850-4
DOI :
10.1109/ASPAA.2003.1285837