مرکز منطقه ای اطلاع رساني علوم و فناوري - Adaptive Time Segmentation for Improved Speech Enhancement

DocumentCode :

788354

Title :

Adaptive Time Segmentation for Improved Speech Enhancement

Author :

Hendriks, Richard C. ; Heusdens, Richard ; Jensen, Jesper

Author_Institution :

Dept. of Mediamatics, Delft Univ. of Technol.

Volume :

Issue :

fYear :

2006

Firstpage :

2064

Lastpage :

2074

Abstract :

Single-channel enhancement algorithms are widely used to overcome the degradation of noisy speech signals. Speech enhancement gain functions are typically computed from two quantities, namely, an estimate of the noise power spectrum and of the noisy speech power spectrum. The variance of these power spectral estimates degrades the quality of the enhanced signal and smoothing techniques are, therefore, often used to decrease the variance. In this paper, we present a method to determine the noisy speech power spectrum based on an adaptive time segmentation. More specifically, the proposed algorithm determines for each noisy frame which of the surrounding frames should contribute to the corresponding noisy power spectral estimate. Further, we demonstrate the potential of our adaptive segmentation in both maximum likelihood and decision direction-based speech enhancement methods by making a better estimate of the a priori signal-to-noise ratio (SNR) xi. Objective and subjective experiments show that an adaptive time segmentation leads to significant performance improvements in comparison to the conventionally used fixed segmentations, particularly in transitional regions, where we observe local SNR improvements in the order of 5 dB

Keywords :

maximum likelihood estimation; smoothing methods; speech enhancement; SNR; adaptive time segmentation; decision direction-based speech enhancement methods; maximum likelihood methods; noise power spectrum; noisy speech power spectrum; noisy speech signals; signal-to-noise ratio; single-channel enhancement algorithms; smoothing techniques; Additive noise; Degradation; Discrete Fourier transforms; Frequency estimation; Materials science and technology; Signal processing; Signal to noise ratio; Smoothing methods; Speech enhancement; Speech processing; Adaptive time segmentation; a priori signal-to-noise ratio (SNR); decision directed approach; hypothesis test; speech enhancement;

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2006.872596

Filename :

1709895

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=788354