DocumentCode :
1653911
Title :
Kernel discriminant analysis for environmental sound recognition based on acoustic subspace
Author :
Jiaxing Ye ; Kobayashi, Takehiko ; Murakawa, Masahiro ; Higuchi, Tatsuro
Author_Institution :
Nat. Inst. of Adv. Ind. Sci. & Technol., Tsukuba, Japan
fYear :
2013
Firstpage :
808
Lastpage :
812
Abstract :
In this paper, we propose an effective discriminant subspace learning framework to recognize the environmental sounds. Firstly, Gabor transform is adopted to characterize the time-frequency distributions of environmental sounds. We further encode the prominent time-frequency patterns with low rank representation by extracting the subspace from Gabor spectrogram. Unlike conventional sound recognition schemes that are mostly based on acoustic feature vectors, we treat the acoustic subspaces (matrixes) as basic elements for recognition, retaining rich temporal-spectral contextual information. At recognition stage, we employ kernel Fisher discriminant analysis to effectively exploit the class conditional distributions of environmental sounds which are favorable for performing multi-class classification. With a well developed kernel function, the proposed approach achieved superior recognition performance on RWCP sound scene database, compared with the existing methods.
Keywords :
audio signal processing; signal classification; transforms; Gabor spectrogram; Gabor transform; RWCP sound scene database; acoustic feature vectors; acoustic subspaces; conventional sound recognition schemes; discriminant subspace learning framework; environmental sound recognition; environmental sounds; kernel Fisher discriminant analysis; kernel discriminant analysis; kernel function; multiclass classification; rich temporal-spectral contextual information recognition; rich temporal-spectral contextual information retaining; time-frequency distributions; time-frequency patterns; Acoustics; Databases; Feature extraction; Kernel; Spectrogram; Time-frequency analysis; Transforms; Environmental sound; Gabor transform; RWCP Sound Scene Database; canonical angle; kernel fisher discriminant analysis; subspace learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6637760
Filename :
6637760
Link To Document :
بازگشت