• DocumentCode
    1350250
  • Title

    On the temporal decorrelation of feature parameters for noise-robust speech recognition

  • Author

    Jung, Ho-Young ; Lee, Soo-Young

  • Author_Institution
    Dept. of Electr. Eng., Korea Adv. Inst. of Sci. & Technol., Seoul, South Korea
  • Volume
    8
  • Issue
    4
  • fYear
    2000
  • fDate
    7/1/2000 12:00:00 AM
  • Firstpage
    407
  • Lastpage
    416
  • Abstract
    We propose a new frame decorrelation method for robust speech recognition in noisy environments. In most cases, signal perturbation is caused by channel distortion and additive background noise, and can be modeled as a slowly varying term in either the log spectral or the linear-spectral domains. Thus, it is effective to deemphasize slowly varying stationary components in the spectral feature domain of speech signals, which can be considered as a temporal decorrelation process. The proposed method presents a well structured high-pass filter using the decorrelation principle, and provides some significant insights into existing high-pass approaches, such as relative spectral (RASTA) processing. The performance of the proposed method was evaluated by speaker-independent isolated-word recognition experiments using the hidden Markov model (HMM). Noisy speech was simulated by adding noise sources taken from the Noisex-92 database. Experimental results showed that the proposed method was effective for speech recognition with significant noise and yielded better performance than other high-pass methods. In addition, we compared the dynamic property of the proposed filter with that of delta features. The feature obtained by the proposed method may offer most of the delta feature property
  • Keywords
    decorrelation; feature extraction; filtering theory; hidden Markov models; high-pass filters; noise; parameter estimation; spectral analysis; speech recognition; HMM; Noisex-92 database; RASTA processing; additive background noise; channel distortion; delta features; dynamic property; experimental results; feature parameters; frame decorrelation method; hidden Markov model; high-pass filter; linear-spectral domain; log spectral domain; noise sources; noise-robust speech recognition; noisy environments; noisy speech simulation; performance evaluation; relative spectral processing; signal perturbation; slowly varying stationary components; speaker-independent isolated-word recognition; spectral feature domain; speech signals; temporal decorrelation; Additive noise; Background noise; Decorrelation; Distortion; Filters; Hidden Markov models; Noise robustness; Speech processing; Speech recognition; Working environment noise;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/89.848222
  • Filename
    848222