DocumentCode :
3560963
Title :
FFT-Based Block Processing in Speech Enhancement: Potential Artifacts and Solutions
Author :
Marin-Hurtado, Jorge Ivan ; Anderson, David V.
Author_Institution :
Sch. of Electr. & Comput. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
Volume :
19
Issue :
8
fYear :
2011
Firstpage :
2527
Lastpage :
2537
Abstract :
Most speech enhancement applications perform frequency shaping by means of multiplication in the frequency domain. Operating in the frequency domain is equivalent to convolution in the time domain. In these speech enhancement algorithms, the updating of frequency response alone cannot ensure the fulfillment of the conditions required for multiplication in frequency to correspond to linear convolution instead of circular convolution. As a result, artifacts and distortions may be present in the output of a standard fast Fourier transform (FFT)-based algorithm. Typical methods to deal with these artifacts involve overlapping and windowing. However, even using these strategies, artifacts may be perceptually noticeable under certain signal-to-noise ratio (SNR) conditions and/or when a high sampling frequency is employed. This paper analyzes the efficiency of the standard methods, explains the source of these distortions, provides a perceptual evidence of these artifacts, and proposes two alternative methods to perform artifact-free and distortion-free FFT convolution. These methods are based on the extension of the impulse response and the splitting of the impulse response in two impulse responses, operations that are performed in the frequency-domain. Computational costs and performance of the proposed techniques are also discussed.
Keywords :
convolution; fast Fourier transforms; signal sampling; speech enhancement; transient response; FFT-based block processing; artifact-free FFT convolution; circular convolution; distortion-free FFT convolution; fast Fourier transform-based algorithm; frequency domain multiplication; frequency shaping; high sampling frequency response; impulse response; signal-to-noise ratio; speech enhancement algorithm; time domain multiplication; Convolution; Fast Fourier transforms; Frequency domain analysis; Speech enhancement; Block-processing artifacts; fast Fourier transform (FFT) convolution; fast convolution; speech enhancement;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
Conference_Location :
5/5/2011 12:00:00 AM
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2011.2150215
Filename :
5762593
Link To Document :
بازگشت