Title :
Exploiting principal component analysis in modulation spectrum enhancement for robust speech recognition
Author :
Jan-Yee Lee ; Jeih-weih Hung
Author_Institution :
Dept. of Environ. Eng., Kun Shan Univ., Tainan, Taiwan
Abstract :
In this paper, we present a novel method to improve the noise robustness of speech features based on principal component analysis (PCA). The PCA process is employed to extract a set of basis spectral vectors for the modulation spectra of clean training speech features. The new modulation spectra of the speech features, constructed by mapping the original modulation spectra into the space spanned by these PCA-derived basis vectors, have shown robustness against the noise distortion. The experiments conducted on the Aurora-2 digit string database revealed that the proposed PCA-based approach, together with mean and variance normalization (MVN), can provide average error reduction rates of over 65% and 12% relative as compared with the baseline MFCC system and that using the MVN method alone, respectively.
Keywords :
audio databases; principal component analysis; speech recognition; Aurora-2 digit string database; MVN method; PCA process; baseline MFCC system; basis spectral vectors; clean training speech features; error reduction; mean and variance normalization; modulation spectra; modulation spectra mapping; noise robustness; principal component analysis; robust speech recognition; Accuracy; Mel frequency cepstral coefficient; Modulation; Principal component analysis; Speech; Speech recognition; Training; modulation spectrum; principal component analysis; robust speech recognition;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2011 Eighth International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-61284-180-9
DOI :
10.1109/FSKD.2011.6019893