Neural network filters for speech enhancement

Author

Knecht, Wolfgang G. ; Schenkel, Markus E. ; Moschytz, George S.

Author_Institution

Swiss Federal Inst. of Technol., Zurich, Switzerland

Volume

3

Issue

6

fYear

1995

fDate

11/1/1995 12:00:00 AM

Firstpage

433

Lastpage

438

Abstract

In adaptive noise cancelling, linear digital filters have been used to minimize the mean squared difference between filter outputs and the desired signal. However, for non-Gaussian probability density functions of the involved signals, nonlinear filters can further reduce the mean squared difference, thereby improving the signal-to-noise ratio at the system output. This is illustrated with a two-microphone beamformer for cancelling directional interference. In the case of a single uniformly distributed interference, we establish the optimum nonlinear performance limit. To approximate optimum performance, we realize two nonlinear filter architectures, the Volterra filter and the multilayer perceptron. The Volterra filter is also examined for speech interference. The beamformer is adapted to minimize the mean squared difference, but performance is measured with the intelligibility weighted gain. This criterion requires the signal-to-noise ratio at the beamformer output. For the nonlinear processor, this can only be determined when no target components exist in the reference channel of the noise canceller so that the target is transmitted without distortion. Under these ideal conditions and at equal filter lengths, the quadratic Volterra filter improves the intelligibility-weighted gain by maximally 2 dB relative to the linear filter

Keywords

acoustic signal processing; adaptive filters; adaptive signal processing; digital filters; filtering theory; interference suppression; microphones; multilayer perceptrons; nonlinear filters; probability; speech enhancement; speech intelligibility; adaptive noise cancellation; beamformer output; directional interference cancellation; filter lengths; filter outputs; intelligibility weighted gain; linear digital filters; mean squared difference; multilayer perceptron; neural network filters; nonGaussian probability density functions; nonlinear filters; nonlinear processor; optimum nonlinear performance limit; quadratic Volterra filter; reference channel; signal-to-noise ratio; speech enhancement; speech interference; two-microphone beamformer; uniformly distributed interference; Adaptive filters; Digital filters; Interference cancellation; Multilayer perceptrons; Neural networks; Noise cancellation; Nonlinear filters; Probability density function; Signal to noise ratio; Speech enhancement;

fLanguage

English

Journal_Title

Speech and Audio Processing, IEEE Transactions on

Publisher

ieee

ISSN

1063-6676

Type

jour

DOI

10.1109/89.482210

Filename

482210