مرکز منطقه ای اطلاع رساني علوم و فناوري - FFT-Based Block Processing in Speech Enhancement: Potential Artifacts and Solutions

DocumentCode :

3560963

Title :

FFT-Based Block Processing in Speech Enhancement: Potential Artifacts and Solutions

Author :

Marin-Hurtado, Jorge Ivan ; Anderson, David V.

Author_Institution :

Sch. of Electr. & Comput. Eng., Georgia Inst. of Technol., Atlanta, GA, USA

Volume :

Issue :

fYear :

2011

Firstpage :

2527

Lastpage :

2537

Abstract :

Most speech enhancement applications perform frequency shaping by means of multiplication in the frequency domain. Operating in the frequency domain is equivalent to convolution in the time domain. In these speech enhancement algorithms, the updating of frequency response alone cannot ensure the fulfillment of the conditions required for multiplication in frequency to correspond to linear convolution instead of circular convolution. As a result, artifacts and distortions may be present in the output of a standard fast Fourier transform (FFT)-based algorithm. Typical methods to deal with these artifacts involve overlapping and windowing. However, even using these strategies, artifacts may be perceptually noticeable under certain signal-to-noise ratio (SNR) conditions and/or when a high sampling frequency is employed. This paper analyzes the efficiency of the standard methods, explains the source of these distortions, provides a perceptual evidence of these artifacts, and proposes two alternative methods to perform artifact-free and distortion-free FFT convolution. These methods are based on the extension of the impulse response and the splitting of the impulse response in two impulse responses, operations that are performed in the frequency-domain. Computational costs and performance of the proposed techniques are also discussed.

Keywords :

convolution; fast Fourier transforms; signal sampling; speech enhancement; transient response; FFT-based block processing; artifact-free FFT convolution; circular convolution; distortion-free FFT convolution; fast Fourier transform-based algorithm; frequency domain multiplication; frequency shaping; high sampling frequency response; impulse response; signal-to-noise ratio; speech enhancement algorithm; time domain multiplication; Convolution; Fast Fourier transforms; Frequency domain analysis; Speech enhancement; Block-processing artifacts; fast Fourier transform (FFT) convolution; fast convolution; speech enhancement;

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

Conference_Location :

5/5/2011 12:00:00 AM

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2011.2150215

Filename :

5762593

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3560963