مرکز منطقه ای اطلاع رساني علوم و فناوري - Phase-based dual-microphone robust speech enhancement

DocumentCode :

1036504

Title :

Phase-based dual-microphone robust speech enhancement

Author :

Aarabi, Parham ; Shi, Guangji

Author_Institution :

Dept. of Electr. & Comput. Eng., Univ. of Toronto, Ont., Canada

Volume :

Issue :

fYear :

2004

Firstpage :

1763

Lastpage :

1773

Abstract :

A dual-microphone speech-signal enhancement algorithm, utilizing phase-error based filters that depend only on the phase of the signals, is proposed. This algorithm involves obtaining time-varying, or alternatively, time-frequency (TF), phase-error filters based on prior knowledge regarding the time difference of arrival (TDOA) of the speech source of interest and the phases of the signals recorded by the microphones. It is shown that by masking the TF representation of the speech signals, the noise components are distorted beyond recognition while the speech source of interest maintains its perceptual quality. This is supported by digit recognition experiments which show a substantial recognition accuracy rate improvement over prior multimicrophone speech enhancement algorithms. For example, for a case with two speakers with a 0.1 s reverberation time, the phase-error based technique results in a 28.9% recognition rate gain over the single channel noisy signal, a gain of 22.0% over superdirective beamforming, and a gain of 8.5% over postfiltering.

Keywords :

microphones; speech enhancement; speech recognition; time-frequency analysis; dual-microphone speech-signal enhancement; microphone array; speech processing; speech recognition; time-frequency phase-error filter; Distortion; Filters; Microphones; Reverberation; Robustness; Speech coding; Speech enhancement; Speech recognition; Time difference of arrival; Time frequency analysis; Algorithms; Computer Simulation; Information Storage and Retrieval; Models, Biological; Pattern Recognition, Automated; Signal Processing, Computer-Assisted; Sound Spectrography; Speech Intelligibility; Speech Production Measurement; Stochastic Processes; Voice Quality;

fLanguage :

English

Journal_Title :

Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on

Publisher :

ieee

ISSN :

1083-4419

Type :

jour

DOI :

10.1109/TSMCB.2004.830345

Filename :

1315759

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1036504