DocumentCode :
1382046
Title :
Sound Event Recognition With Probabilistic Distance SVMs
Author :
Tran, Huy Dat ; Li, Haizhou
Author_Institution :
Human Language Technol. Dept., A*STAR, Singapore, Singapore
Volume :
19
Issue :
6
fYear :
2011
Firstpage :
1556
Lastpage :
1568
Abstract :
Unlike other audio or speech signals, sound events have a relatively short time span. They are usually distinguished by their unique spectro-temporal signature. This paper proposes a novel classification method based on probabilistic distance support vector machines (SVMs). We study a parametric approach to characterizing sound signals using the distribution of the subband temporal envelope (STE), and kernel techniques for the subband probabilistic distance (SPD) under the framework of SVM. We show that generalized gamma modeling is well devised for sound characterization, and that the probabilistic distance kernel provides a closed form solution to the calculation of divergence distance, which tremendously reduces computational cost. We conducted experiments on a database of ten types of sound events. The results show that the proposed classification method significantly outperforms conventional SVM classifiers with Mel-frequency cepstral coefficients (MFCCs). The rapid computation of probabilistic distance also makes the proposed method an obvious choice for online sound event recognition.
Keywords :
audio signal processing; probability; speech recognition; support vector machines; MFCC; Mel-frequency cepstral coefficient; SPD; STE; audio signal; classification method; computational cost reduction; gamma modeling; kernel technique; probabilistic distance support vector machine; sound event recognition; speech signal; subband probabilistic distance SVM; subband temporal envelope; unique spectro-temporal signature; Kernel; Mel frequency cepstral coefficient; Probabilistic logic; Speech; Speech recognition; Support vector machines; Wavelet transforms; Divergence distance; probabilistic distance; sound characterization; sound event recognition; subband temporal envelope (STE); support vector machine (SVM);
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2010.2093519
Filename :
5639032
Link To Document :
بازگشت