DocumentCode :
3530449
Title :
Robust speech feature extraction based on Gabor filtering and tensor factorization
Author :
Wu, Qiang ; Zhang, Liqing ; Shi, Guangchuan
Author_Institution :
Dept. of Comput. Sci., Shanghai Jiao Tong Univ., Shanghai
fYear :
2009
fDate :
19-24 April 2009
Firstpage :
4649
Lastpage :
4652
Abstract :
In this paper, we investigate the speech feature extraction problem in the noisy environment. A novel approach based on Gabor filtering and tensor factorization is proposed. From recent physiological and psychoacoustic experimental results, localized spectro-temporal features are essential for auditory perception. We employ 2D-Gabor functions with different scales and directions to analyze the localized patches of power spectrogram, by which speech signal can be encoded as a general higher order tensor. Then nonnegative tensor PCA with sparse constraint is used to learn the projection matrices from multiple interrelated feature subspaces and extract the robust features. Experimental results confirm that our proposed method can improve the speech recognition performance, especially in noisy environment, compared with traditional speech feature extraction methods.
Keywords :
Gabor filters; feature extraction; hearing; matrix decomposition; principal component analysis; speech recognition; tensors; Gabor filtering; auditory perception; localized spectro-temporal feature; multiple interrelated feature subspace; nonnegative tensor PCA; power spectrogram; projection matrix; robust speech feature extraction; tensor factorization; Feature extraction; Filtering; Gabor filters; Psychology; Robustness; Signal analysis; Spectrogram; Speech analysis; Tensile stress; Working environment noise; acoustic noise; auditory perception; feature extraction; gabor filtering; speech recognition; tensor factorization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1520-6149
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2009.4960667
Filename :
4960667
Link To Document :
بازگشت