• DocumentCode
    846254
  • Title

    Microphone array post-filter based on noise field coherence

  • Author

    McCowan, Iain A. ; Bourlard, Hervé

  • Author_Institution
    Dalle Molle Inst. for Perceptual Artificial Intelligence, Martigny, Switzerland
  • Volume
    11
  • Issue
    6
  • fYear
    2003
  • Firstpage
    709
  • Lastpage
    716
  • Abstract
    This paper introduces a novel technique for estimating the signal power spectral density to be used in the transfer function of a microphone array post-filter. The technique is a generalization of the existing Zelinski post-filter, which uses the auto- and cross-spectral densities of the array inputs to estimate the signal and noise spectral densities. The Zelinski technique, however, assumes zero cross-correlation between the noise on different sensors. This assumption is inaccurate, particularly at low frequencies and for arrays with closely spaced sensors, and thus the corresponding post-filter is suboptimal in realistic noise conditions. In this paper, a more general expression of the post-filter estimation is developed based on an assumed knowledge of the complex coherence of the noise field. This general expression can be used to construct a more appropriate post-filter in a variety of different noise fields. In experiments using real noise recordings from a computer office, the modified post-filter results in significant improvement in terms of objective speech quality measures and speech recognition performance using a diffuse noise model.
  • Keywords
    acoustic signal processing; acoustic transducer arrays; array signal processing; digital filters; microphones; spectral analysis; speech recognition; computer office noise; diffuse noise model; microphone array; noise field coherence; noise field complex coherence; objective speech quality; post-filter estimation; speech recognition; Array signal processing; Coherence; Frequency; Genetic expression; Low-frequency noise; Microphone arrays; Sensor arrays; Speech enhancement; Speech recognition; Transfer functions;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/TSA.2003.818212
  • Filename
    1255457