• DocumentCode
    394274
  • Title

    Sequential MAP estimation based speech feature enhancement for noise robust speech recognition

  • Author

    Jia, Chuan ; Ding, Peng ; Xu, Bo

  • Author_Institution
    Inst. of Autom., Acad. Sinica, Beijing, China
  • Volume
    1
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    In this paper, the environment mismatch due to additive noise is assumed as an additive bias in power spectral domain. It is viable to introduce some constraints on the values that the bias can take due to the internal relation between bias and noise power spectrum. We propose to introduce the noise priori knowledge into bias estimation process by using maximum a posteriori (MAP) criterion. Moreover, the mismatch is usually nonstationary in real application and sequential algorithm can be used to track time varying environment within a test utterance. This paper proposes to use the sequential techniques to estimate the bias in the MAP framework and update the parameters of noise priori adaptively. Speech recognition experiments demonstrated that the proposed algorithm outperformed sequential ML estimation method and was obviously better than the batch mode under non-stationary noise environment.
  • Keywords
    feature extraction; maximum likelihood estimation; sequential estimation; spectral analysis; speech enhancement; speech recognition; white noise; additive noise; bias estimation; environment mismatch; maximum a posteriori criterion; noise power spectrum; noise robust speech recognition; nonstationary noise environment; power spectral domain; sequential MAP estimation; sequential ML estimation method; sequential algorithm; speech feature enhancement; test utterance; time varying environment tracking; white noise source; Additive noise; Automatic speech recognition; Maximum likelihood estimation; Noise robustness; Parameter estimation; Speech enhancement; Speech recognition; Stochastic resonance; Testing; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1198805
  • Filename
    1198805