مرکز منطقه ای اطلاع رساني علوم و فناوري - Exploiting principal component analysis in modulation spectrum enhancement for robust speech recognition

DocumentCode :

553217

Title :

Exploiting principal component analysis in modulation spectrum enhancement for robust speech recognition

Author :

Jan-Yee Lee ; Jeih-weih Hung

Author_Institution :

Dept. of Environ. Eng., Kun Shan Univ., Tainan, Taiwan

Volume :

fYear :

2011

fDate :

26-28 July 2011

Firstpage :

1947

Lastpage :

1951

Abstract :

In this paper, we present a novel method to improve the noise robustness of speech features based on principal component analysis (PCA). The PCA process is employed to extract a set of basis spectral vectors for the modulation spectra of clean training speech features. The new modulation spectra of the speech features, constructed by mapping the original modulation spectra into the space spanned by these PCA-derived basis vectors, have shown robustness against the noise distortion. The experiments conducted on the Aurora-2 digit string database revealed that the proposed PCA-based approach, together with mean and variance normalization (MVN), can provide average error reduction rates of over 65% and 12% relative as compared with the baseline MFCC system and that using the MVN method alone, respectively.

Keywords :

audio databases; principal component analysis; speech recognition; Aurora-2 digit string database; MVN method; PCA process; baseline MFCC system; basis spectral vectors; clean training speech features; error reduction; mean and variance normalization; modulation spectra; modulation spectra mapping; noise robustness; principal component analysis; robust speech recognition; Accuracy; Mel frequency cepstral coefficient; Modulation; Principal component analysis; Speech; Speech recognition; Training; modulation spectrum; principal component analysis; robust speech recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Fuzzy Systems and Knowledge Discovery (FSKD), 2011 Eighth International Conference on

Conference_Location :

Shanghai

Print_ISBN :

978-1-61284-180-9

Type :

conf

DOI :

10.1109/FSKD.2011.6019893

Filename :

6019893

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=553217