مرکز منطقه ای اطلاع رساني علوم و فناوري - Phase-Based Dual-Microphone Speech Enhancement Using A Prior Speech Model

DocumentCode :

865887

Title :

Phase-Based Dual-Microphone Speech Enhancement Using A Prior Speech Model

Author :

Shi, Guangji ; Aarabi, Parham ; Jiang, Hui

Author_Institution :

Dept. of Electr. & Comput. Eng., Univ. of Toronto, Ont.

Volume :

Issue :

fYear :

2007

Firstpage :

109

Lastpage :

118

Abstract :

This paper proposes a phase-based dual-microphone speech enhancement technique that utilizes a prior speech model. Recently, it has been shown that phase-based dual-microphone filters can result in significant noise reduction in low signal-to-noise ratio [(SNR) less than 10 dB] conditions and negligible distortion at high SNRs (greater than 10 dB), as long as a correct filter parameter is chosen at each SNR. While prior work utilizes a constant parameter for all SNRs, we present an SNR-adaptive filter parameter estimation algorithm that maximizes the likelihood of the enhanced speech features based on a prior speech model. Experimental results using the CARVUI database show significant speech recognition accuracy rate improvement over alternative techniques in low SNR situations (e.g., an improvement of 11% in word error rate (WER) over postfiltering and 23% over delay-and-sum beamforming at 0 dB) and negligible distortion at high SNRs. The proposed adaptive approach also significantly outperforms the original phase-based filter with a constant parameter. Furthermore, it improves the filter´s robustness when there are errors in time delay estimation

Keywords :

adaptive filters; microphones; speech enhancement; speech recognition; SNR-adaptive filter; a prior speech model; delay-and-sum beamforming; low signal-to-noise ratio; parameter estimation algorithm; phase-based dual-microphone speech enhancement; speech recognition; time delay estimation; word error rate; Delay; Error analysis; Filters; Noise reduction; Parameter estimation; Phase distortion; Signal to noise ratio; Spatial databases; Speech enhancement; Speech recognition; Microphone array; phase-error filtering; robust speech recognition; speech enhancement; time-frequency masking;

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2006.876870

Filename :

4032794

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=865887