DocumentCode
1350250
Title
On the temporal decorrelation of feature parameters for noise-robust speech recognition
Author
Jung, Ho-Young ; Lee, Soo-Young
Author_Institution
Dept. of Electr. Eng., Korea Adv. Inst. of Sci. & Technol., Seoul, South Korea
Volume
8
Issue
4
fYear
2000
fDate
7/1/2000 12:00:00 AM
Firstpage
407
Lastpage
416
Abstract
We propose a new frame decorrelation method for robust speech recognition in noisy environments. In most cases, signal perturbation is caused by channel distortion and additive background noise, and can be modeled as a slowly varying term in either the log spectral or the linear-spectral domains. Thus, it is effective to deemphasize slowly varying stationary components in the spectral feature domain of speech signals, which can be considered as a temporal decorrelation process. The proposed method presents a well structured high-pass filter using the decorrelation principle, and provides some significant insights into existing high-pass approaches, such as relative spectral (RASTA) processing. The performance of the proposed method was evaluated by speaker-independent isolated-word recognition experiments using the hidden Markov model (HMM). Noisy speech was simulated by adding noise sources taken from the Noisex-92 database. Experimental results showed that the proposed method was effective for speech recognition with significant noise and yielded better performance than other high-pass methods. In addition, we compared the dynamic property of the proposed filter with that of delta features. The feature obtained by the proposed method may offer most of the delta feature property
Keywords
decorrelation; feature extraction; filtering theory; hidden Markov models; high-pass filters; noise; parameter estimation; spectral analysis; speech recognition; HMM; Noisex-92 database; RASTA processing; additive background noise; channel distortion; delta features; dynamic property; experimental results; feature parameters; frame decorrelation method; hidden Markov model; high-pass filter; linear-spectral domain; log spectral domain; noise sources; noise-robust speech recognition; noisy environments; noisy speech simulation; performance evaluation; relative spectral processing; signal perturbation; slowly varying stationary components; speaker-independent isolated-word recognition; spectral feature domain; speech signals; temporal decorrelation; Additive noise; Background noise; Decorrelation; Distortion; Filters; Hidden Markov models; Noise robustness; Speech processing; Speech recognition; Working environment noise;
fLanguage
English
Journal_Title
Speech and Audio Processing, IEEE Transactions on
Publisher
ieee
ISSN
1063-6676
Type
jour
DOI
10.1109/89.848222
Filename
848222
Link To Document