Title :
Phase-Based Dual-Microphone Speech Enhancement Using A Prior Speech Model
Author :
Shi, Guangji ; Aarabi, Parham ; Jiang, Hui
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Toronto, Ont.
Abstract :
This paper proposes a phase-based dual-microphone speech enhancement technique that utilizes a prior speech model. Recently, it has been shown that phase-based dual-microphone filters can result in significant noise reduction in low signal-to-noise ratio [(SNR) less than 10 dB] conditions and negligible distortion at high SNRs (greater than 10 dB), as long as a correct filter parameter is chosen at each SNR. While prior work utilizes a constant parameter for all SNRs, we present an SNR-adaptive filter parameter estimation algorithm that maximizes the likelihood of the enhanced speech features based on a prior speech model. Experimental results using the CARVUI database show significant speech recognition accuracy rate improvement over alternative techniques in low SNR situations (e.g., an improvement of 11% in word error rate (WER) over postfiltering and 23% over delay-and-sum beamforming at 0 dB) and negligible distortion at high SNRs. The proposed adaptive approach also significantly outperforms the original phase-based filter with a constant parameter. Furthermore, it improves the filter´s robustness when there are errors in time delay estimation
Keywords :
adaptive filters; microphones; speech enhancement; speech recognition; SNR-adaptive filter; a prior speech model; delay-and-sum beamforming; low signal-to-noise ratio; parameter estimation algorithm; phase-based dual-microphone speech enhancement; speech recognition; time delay estimation; word error rate; Delay; Error analysis; Filters; Noise reduction; Parameter estimation; Phase distortion; Signal to noise ratio; Spatial databases; Speech enhancement; Speech recognition; Microphone array; phase-error filtering; robust speech recognition; speech enhancement; time-frequency masking;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2006.876870