• DocumentCode
    2279692
  • Title

    Time-varying noise compensation by sequential Monte Carlo method

  • Author

    Yao, Kaisheng ; Nakamura, Satoshi

  • Author_Institution
    ATR Spoken Language Translation Res. Labs., Kyoto, Japan
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    163
  • Lastpage
    166
  • Abstract
    We present a sequential Monte Carlo method applied to additive noise compensation for robust speech recognition in time-varying noise. At each frame, the method generates a set of samples, approximating the posterior distribution of speech and noise parameters for given observation sequences to the current frame. An explicit model representing noise effects on speech features is used, so that an extended Kalman filter is constructed for each sample, generating an updated continuous state as the estimation of the noise parameter, and prediction likelihood as the weight of each sample for minimum mean square error inference of the time-varying noise parameter over these samples. A selection step and a smoothing step are used to improve efficiency. Through experiments, we observed significant performance improvement over that achieved by noise compensation with a stationary noise assumption. It also performed better than the sequential EM algorithm in machine-gun noise.
  • Keywords
    Kalman filters; Monte Carlo methods; acoustic noise; inference mechanisms; interference suppression; least mean squares methods; parameter estimation; prediction theory; speech recognition; MMSE inference; extended Kalman filter; machine-gun noise; minimum mean square error inference; noise parameter estimation; prediction likelihood; robust speech recognition; sequential Monte Carlo method; time-varying noise compensation; Additive noise; Inference algorithms; Mean square error methods; Noise generators; Noise robustness; Predictive models; Smoothing methods; Speech enhancement; Speech recognition; State estimation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on
  • Print_ISBN
    0-7803-7343-X
  • Type

    conf

  • DOI
    10.1109/ASRU.2001.1034613
  • Filename
    1034613