Time-varying noise compensation by sequential Monte Carlo method

Author

Yao, Kaisheng ; Nakamura, Satoshi

Author_Institution

ATR Spoken Language Translation Res. Labs., Kyoto, Japan

fYear

2001

fDate

2001

Firstpage

163

Lastpage

166

Abstract

We present a sequential Monte Carlo method applied to additive noise compensation for robust speech recognition in time-varying noise. At each frame, the method generates a set of samples, approximating the posterior distribution of speech and noise parameters for given observation sequences to the current frame. An explicit model representing noise effects on speech features is used, so that an extended Kalman filter is constructed for each sample, generating an updated continuous state as the estimation of the noise parameter, and prediction likelihood as the weight of each sample for minimum mean square error inference of the time-varying noise parameter over these samples. A selection step and a smoothing step are used to improve efficiency. Through experiments, we observed significant performance improvement over that achieved by noise compensation with a stationary noise assumption. It also performed better than the sequential EM algorithm in machine-gun noise.

Keywords

Kalman filters; Monte Carlo methods; acoustic noise; inference mechanisms; interference suppression; least mean squares methods; parameter estimation; prediction theory; speech recognition; MMSE inference; extended Kalman filter; machine-gun noise; minimum mean square error inference; noise parameter estimation; prediction likelihood; robust speech recognition; sequential Monte Carlo method; time-varying noise compensation; Additive noise; Inference algorithms; Mean square error methods; Noise generators; Noise robustness; Predictive models; Smoothing methods; Speech enhancement; Speech recognition; State estimation;

fLanguage

English

Publisher

ieee

Conference_Titel

Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on

Print_ISBN

0-7803-7343-X

Type

conf

DOI

10.1109/ASRU.2001.1034613

Filename

1034613