Title :
A dynamic programming approach to audio segmentation and speech/music discrimination
Author :
Goodwin, Michael M. ; Laroche, Jean
Author_Institution :
Creative Adv. Technol. Center, Scotts Valley, CA, USA
Abstract :
We consider the problem of segmenting an audio signal into characteristic regions based on feature-set similarities. In the proposed approach, a feature-space representation of the signal is generated; sequences of these feature-space samples are then aggregated into clusters corresponding to distinct signal regions. The algorithm consists of using linear discriminant analysis (LDA) to condition the feature space and dynamic programming (DP) to identify data clusters. We consider the design of the dynamic program cost functions; we are able to derive effective cost functions without relying on significant prior information about the structure of the expected data clusters. We demonstrate the application of the LDA-DP segmentation algorithm to speech/music discrimination. Experimental results are given and discussed.
Keywords :
audio signal processing; dynamic programming; music; speech; speech processing; audio segmentation; audio signal segmentation; data clusters; dynamic program cost functions; dynamic programming; feature-space representation; linear discriminant analysis; signal representation; speech/music discrimination; Clustering algorithms; Cost function; Covariance matrix; Dynamic programming; Fingerprint recognition; Linear discriminant analysis; Multiple signal classification; Robustness; Signal generators; Speech;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1326825