• DocumentCode
    1246596
  • Title

    Neural network filters for speech enhancement

  • Author

    Knecht, Wolfgang G. ; Schenkel, Markus E. ; Moschytz, George S.

  • Author_Institution
    Swiss Federal Inst. of Technol., Zurich, Switzerland
  • Volume
    3
  • Issue
    6
  • fYear
    1995
  • fDate
    11/1/1995 12:00:00 AM
  • Firstpage
    433
  • Lastpage
    438
  • Abstract
    In adaptive noise cancelling, linear digital filters have been used to minimize the mean squared difference between filter outputs and the desired signal. However, for non-Gaussian probability density functions of the involved signals, nonlinear filters can further reduce the mean squared difference, thereby improving the signal-to-noise ratio at the system output. This is illustrated with a two-microphone beamformer for cancelling directional interference. In the case of a single uniformly distributed interference, we establish the optimum nonlinear performance limit. To approximate optimum performance, we realize two nonlinear filter architectures, the Volterra filter and the multilayer perceptron. The Volterra filter is also examined for speech interference. The beamformer is adapted to minimize the mean squared difference, but performance is measured with the intelligibility weighted gain. This criterion requires the signal-to-noise ratio at the beamformer output. For the nonlinear processor, this can only be determined when no target components exist in the reference channel of the noise canceller so that the target is transmitted without distortion. Under these ideal conditions and at equal filter lengths, the quadratic Volterra filter improves the intelligibility-weighted gain by maximally 2 dB relative to the linear filter
  • Keywords
    acoustic signal processing; adaptive filters; adaptive signal processing; digital filters; filtering theory; interference suppression; microphones; multilayer perceptrons; nonlinear filters; probability; speech enhancement; speech intelligibility; adaptive noise cancellation; beamformer output; directional interference cancellation; filter lengths; filter outputs; intelligibility weighted gain; linear digital filters; mean squared difference; multilayer perceptron; neural network filters; nonGaussian probability density functions; nonlinear filters; nonlinear processor; optimum nonlinear performance limit; quadratic Volterra filter; reference channel; signal-to-noise ratio; speech enhancement; speech interference; two-microphone beamformer; uniformly distributed interference; Adaptive filters; Digital filters; Interference cancellation; Multilayer perceptrons; Neural networks; Noise cancellation; Nonlinear filters; Probability density function; Signal to noise ratio; Speech enhancement;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/89.482210
  • Filename
    482210