DocumentCode :
706075
Title :
Speech event detection by non negative matrix deconvolution
Author :
Lopes, Carla ; Perdigao, Fernando
Author_Institution :
Inst. de Telecomun., Polo II, Coimbra, Portugal
fYear :
2007
fDate :
3-7 Sept. 2007
Firstpage :
1280
Lastpage :
1284
Abstract :
Support Vector Machines (SVM) are applied to the problem of detecting and classifying broad acoustic-phonetic classes (events). In this paper an approach based on Non-Negative Matrix Deconvolution (NMD) is proposed to merge frame-based SVM predictions into segmental events. To turn the SVM outputs, which are frame-based, into a signal segmented in terms of events, two different event merger methods were tested and the results, using TIMIT speech data, were compared to a broad class detector, built using HMMs with an MFCC front-end. Results show that NMD efficiently controls the number of insertion and deletion errors and outperforms HMM´s accuracy. The quality of the event segmenter was measured by means of a recently proposed methodology to evaluate event detectors performance and the results show that the proposed approach also outperforms the competing ones.
Keywords :
acoustic signal processing; cepstral analysis; deconvolution; hidden Markov models; signal classification; speech recognition; support vector machines; HMM; MFCC front-end; Mel-frequency cepstral coefficients; NMD; TIMIT speech data; acoustic-phonetic classes; class detector; event detectors; event merger methods; event segmenter; frame-based SVM predictions; hidden Markov models; nonnegative matrix deconvolution; segmental events; speech event detection; support vector machines; Accuracy; Acoustics; Corporate acquisitions; Hidden Markov models; Speech; Support vector machines; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference, 2007 15th European
Conference_Location :
Poznan
Print_ISBN :
978-839-2134-04-6
Type :
conf
Filename :
7099011
Link To Document :
بازگشت