DocumentCode :
846254
Title :
Microphone array post-filter based on noise field coherence
Author :
McCowan, Iain A. ; Bourlard, Hervé
Author_Institution :
Dalle Molle Inst. for Perceptual Artificial Intelligence, Martigny, Switzerland
Volume :
11
Issue :
6
fYear :
2003
Firstpage :
709
Lastpage :
716
Abstract :
This paper introduces a novel technique for estimating the signal power spectral density to be used in the transfer function of a microphone array post-filter. The technique is a generalization of the existing Zelinski post-filter, which uses the auto- and cross-spectral densities of the array inputs to estimate the signal and noise spectral densities. The Zelinski technique, however, assumes zero cross-correlation between the noise on different sensors. This assumption is inaccurate, particularly at low frequencies and for arrays with closely spaced sensors, and thus the corresponding post-filter is suboptimal in realistic noise conditions. In this paper, a more general expression of the post-filter estimation is developed based on an assumed knowledge of the complex coherence of the noise field. This general expression can be used to construct a more appropriate post-filter in a variety of different noise fields. In experiments using real noise recordings from a computer office, the modified post-filter results in significant improvement in terms of objective speech quality measures and speech recognition performance using a diffuse noise model.
Keywords :
acoustic signal processing; acoustic transducer arrays; array signal processing; digital filters; microphones; spectral analysis; speech recognition; computer office noise; diffuse noise model; microphone array; noise field coherence; noise field complex coherence; objective speech quality; post-filter estimation; speech recognition; Array signal processing; Coherence; Frequency; Genetic expression; Low-frequency noise; Microphone arrays; Sensor arrays; Speech enhancement; Speech recognition; Transfer functions;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/TSA.2003.818212
Filename :
1255457
Link To Document :
بازگشت