• DocumentCode
    2654691
  • Title

    Integrating Kalman filtering and multi-pulse coding for speech enhancement with a non-stationary model of the speech signal

  • Author

    Li, Chunjian ; Andersen, Søren Vang

  • Author_Institution
    Dept. of Commun. Technol., Aalborg Univ., Denmark
  • Volume
    2
  • fYear
    2004
  • fDate
    7-10 Nov. 2004
  • Firstpage
    2300
  • Abstract
    In this paper, speech enhancement via Kalman filtering is considered. A non-stationary signal model for the speech signal is first described. This model consists of a slowly varying AR model and an excitation source that exhibits a rapidly time-varying variance. The AR model and the excitation model fit nicely into the Kalman filtering framework, fully exploiting the capability of the Kalman filter to process non-stationary signals in an LMMSE optimum manner. The AR-model coefficients are estimated by a decision-directed type power spectral subtraction method followed by an LPC analysis. For the robust estimation of the rapidly time-varying excitation model in the presence of noise, we propose the use of a multi-pulse linear predictive coding (MPLPC) based method. The Kalman filtering algorithm based on the non-stationary signal model is able to partially avoid the commonly used quasi-stationarity assumption of the speech. Therefore the non-stationarity of the signal is fully exploited in suppressing the noise power that is more stationary. Our experiments show that the Kalman filter with rapidly time-varying variance modeling using the proposed MPLPC based method brings significant performance improvement both when compared to a baseline Kalman filtering method with quasi-stationarity assumption and when compared to the well-known MMSE log-spectral amplitude estimator (MMSE-LSA).
  • Keywords
    Kalman filters; amplitude estimation; filtering theory; least mean squares methods; linear codes; signal denoising; spectral analysis; speech coding; speech enhancement; Kalman filtering; LMMSE optimum manner; excitation source; log-spectral amplitude estimator; multipulse linear predictive coding; noise power suppression; power spectral subtraction method; robust estimation; speech enhancement; speech quasi-stationarity assumption; speech signal nonstationary model; time-varying excitation model; time-varying variance; Filtering; Kalman filters; Linear predictive coding; Noise robustness; Signal processing; Signal resolution; Speech analysis; Speech coding; Speech enhancement; Wiener filter;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signals, Systems and Computers, 2004. Conference Record of the Thirty-Eighth Asilomar Conference on
  • Print_ISBN
    0-7803-8622-1
  • Type

    conf

  • DOI
    10.1109/ACSSC.2004.1399578
  • Filename
    1399578