مرکز منطقه ای اطلاع رساني علوم و فناوري - Correlation-Based and Model-Based Blind Single-Channel Late-Reverberation Suppression in Noisy Time-Varying Acoustical Environments

DocumentCode :

1501636

Title :

Correlation-Based and Model-Based Blind Single-Channel Late-Reverberation Suppression in Noisy Time-Varying Acoustical Environments

Author :

Erkelens, Jan S. ; Heusdens, Richard

Author_Institution :

Dept. of Mediamatics, Delft Univ. of Technol., Delft, Netherlands

Volume :

Issue :

fYear :

2010

Firstpage :

1746

Lastpage :

1765

Abstract :

This paper considers suppression of late reverberation and additive noise in single-channel speech recordings. The reverberation introduces long-term correlation in the observed signal. In the first part of this work, we show how this correlation can be used to estimate the late reverberant spectral variance (LRSV) without having to assume a specific model for the room impulse responses (RIRs) while no explicit estimates of RIR model parameters are needed. That makes this correlation-based approach more robust against RIR modeling errors. However, the correlation-based method can follow only slow time variations in the RIRs. Existing model-based methods use statistical models for the RIRs, that depend on one or more parameters that have to be estimated blindly. The common statistical models lead to simple expressions for the LRSV that depend on past values of the spectral variance of the reverberant, noise-free, signal. All existing model-based LRSV estimators in the literature are derived assuming the RIRs to be time-invariant realizations of a stochastic process. In the second part of this paper, we go one step further and analyze time-varying RIRs. We show that in this case the reverberance tends to become decorrelated. We discuss the relations between different RIR models and their corresponding LRSV estimators. We show theoretically that similar simple estimators exist as in the time-invariant case, provided that the reverberation time T₆₀ and direct-to-reverberation ratio (DRR) of the RIRs remain nearly constant during an interval of the order of a few frames. We show that the reverberation time can be taken frequency-bin independent in DFT-based enhancement algorithms. Experiments with time-varying RIRs validate the analysis. Experiments with additive nonstationary noise and time-invariant RIRs show the influence of blind estimation of the reverberation time and the DRR.

Keywords :

discrete Fourier transforms; reverberation; speech enhancement; statistical analysis; stochastic processes; time-varying channels; DFT-based enhancement algorithms; DRR; correlation-based blind single-channel late-reverberation suppression; direct-to-reverberation ratio; discrete Fourier transform; late reverberant spectral variance; model-based LRSV estimators; model-based blind single-channel late-reverberation suppression; noisy time-varying acoustical environments; room impulse response; single-channel speech recordings; speech enhancement; statistical models; stochastic process; time-varying RIR model parameters; Additive noise; Decorrelation; Frequency; Permission; Speech enhancement; Stochastic processes; Working environment noise; Discrete Fourier transform (DFT)-based speech enhancement;

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2010.2051271

Filename :

5471178

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1501636