DocumentCode :
3342256
Title :
A dynamic programming approach to audio segmentation and speech/music discrimination
Author :
Goodwin, Michael M. ; Laroche, Jean
Author_Institution :
Creative Adv. Technol. Center, Scotts Valley, CA, USA
Volume :
4
fYear :
2004
fDate :
17-21 May 2004
Abstract :
We consider the problem of segmenting an audio signal into characteristic regions based on feature-set similarities. In the proposed approach, a feature-space representation of the signal is generated; sequences of these feature-space samples are then aggregated into clusters corresponding to distinct signal regions. The algorithm consists of using linear discriminant analysis (LDA) to condition the feature space and dynamic programming (DP) to identify data clusters. We consider the design of the dynamic program cost functions; we are able to derive effective cost functions without relying on significant prior information about the structure of the expected data clusters. We demonstrate the application of the LDA-DP segmentation algorithm to speech/music discrimination. Experimental results are given and discussed.
Keywords :
audio signal processing; dynamic programming; music; speech; speech processing; audio segmentation; audio signal segmentation; data clusters; dynamic program cost functions; dynamic programming; feature-space representation; linear discriminant analysis; signal representation; speech/music discrimination; Clustering algorithms; Cost function; Covariance matrix; Dynamic programming; Fingerprint recognition; Linear discriminant analysis; Multiple signal classification; Robustness; Signal generators; Speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1326825
Filename :
1326825
Link To Document :
بازگشت