DocumentCode
1653911
Title
Kernel discriminant analysis for environmental sound recognition based on acoustic subspace
Author
Jiaxing Ye ; Kobayashi, Takehiko ; Murakawa, Masahiro ; Higuchi, Tatsuro
Author_Institution
Nat. Inst. of Adv. Ind. Sci. & Technol., Tsukuba, Japan
fYear
2013
Firstpage
808
Lastpage
812
Abstract
In this paper, we propose an effective discriminant subspace learning framework to recognize the environmental sounds. Firstly, Gabor transform is adopted to characterize the time-frequency distributions of environmental sounds. We further encode the prominent time-frequency patterns with low rank representation by extracting the subspace from Gabor spectrogram. Unlike conventional sound recognition schemes that are mostly based on acoustic feature vectors, we treat the acoustic subspaces (matrixes) as basic elements for recognition, retaining rich temporal-spectral contextual information. At recognition stage, we employ kernel Fisher discriminant analysis to effectively exploit the class conditional distributions of environmental sounds which are favorable for performing multi-class classification. With a well developed kernel function, the proposed approach achieved superior recognition performance on RWCP sound scene database, compared with the existing methods.
Keywords
audio signal processing; signal classification; transforms; Gabor spectrogram; Gabor transform; RWCP sound scene database; acoustic feature vectors; acoustic subspaces; conventional sound recognition schemes; discriminant subspace learning framework; environmental sound recognition; environmental sounds; kernel Fisher discriminant analysis; kernel discriminant analysis; kernel function; multiclass classification; rich temporal-spectral contextual information recognition; rich temporal-spectral contextual information retaining; time-frequency distributions; time-frequency patterns; Acoustics; Databases; Feature extraction; Kernel; Spectrogram; Time-frequency analysis; Transforms; Environmental sound; Gabor transform; RWCP Sound Scene Database; canonical angle; kernel fisher discriminant analysis; subspace learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location
Vancouver, BC
ISSN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2013.6637760
Filename
6637760
Link To Document