DocumentCode :
553217
Title :
Exploiting principal component analysis in modulation spectrum enhancement for robust speech recognition
Author :
Jan-Yee Lee ; Jeih-weih Hung
Author_Institution :
Dept. of Environ. Eng., Kun Shan Univ., Tainan, Taiwan
Volume :
3
fYear :
2011
fDate :
26-28 July 2011
Firstpage :
1947
Lastpage :
1951
Abstract :
In this paper, we present a novel method to improve the noise robustness of speech features based on principal component analysis (PCA). The PCA process is employed to extract a set of basis spectral vectors for the modulation spectra of clean training speech features. The new modulation spectra of the speech features, constructed by mapping the original modulation spectra into the space spanned by these PCA-derived basis vectors, have shown robustness against the noise distortion. The experiments conducted on the Aurora-2 digit string database revealed that the proposed PCA-based approach, together with mean and variance normalization (MVN), can provide average error reduction rates of over 65% and 12% relative as compared with the baseline MFCC system and that using the MVN method alone, respectively.
Keywords :
audio databases; principal component analysis; speech recognition; Aurora-2 digit string database; MVN method; PCA process; baseline MFCC system; basis spectral vectors; clean training speech features; error reduction; mean and variance normalization; modulation spectra; modulation spectra mapping; noise robustness; principal component analysis; robust speech recognition; Accuracy; Mel frequency cepstral coefficient; Modulation; Principal component analysis; Speech; Speech recognition; Training; modulation spectrum; principal component analysis; robust speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2011 Eighth International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-61284-180-9
Type :
conf
DOI :
10.1109/FSKD.2011.6019893
Filename :
6019893
Link To Document :
بازگشت