DocumentCode :
1501636
Title :
Correlation-Based and Model-Based Blind Single-Channel Late-Reverberation Suppression in Noisy Time-Varying Acoustical Environments
Author :
Erkelens, Jan S. ; Heusdens, Richard
Author_Institution :
Dept. of Mediamatics, Delft Univ. of Technol., Delft, Netherlands
Volume :
18
Issue :
7
fYear :
2010
Firstpage :
1746
Lastpage :
1765
Abstract :
This paper considers suppression of late reverberation and additive noise in single-channel speech recordings. The reverberation introduces long-term correlation in the observed signal. In the first part of this work, we show how this correlation can be used to estimate the late reverberant spectral variance (LRSV) without having to assume a specific model for the room impulse responses (RIRs) while no explicit estimates of RIR model parameters are needed. That makes this correlation-based approach more robust against RIR modeling errors. However, the correlation-based method can follow only slow time variations in the RIRs. Existing model-based methods use statistical models for the RIRs, that depend on one or more parameters that have to be estimated blindly. The common statistical models lead to simple expressions for the LRSV that depend on past values of the spectral variance of the reverberant, noise-free, signal. All existing model-based LRSV estimators in the literature are derived assuming the RIRs to be time-invariant realizations of a stochastic process. In the second part of this paper, we go one step further and analyze time-varying RIRs. We show that in this case the reverberance tends to become decorrelated. We discuss the relations between different RIR models and their corresponding LRSV estimators. We show theoretically that similar simple estimators exist as in the time-invariant case, provided that the reverberation time T60 and direct-to-reverberation ratio (DRR) of the RIRs remain nearly constant during an interval of the order of a few frames. We show that the reverberation time can be taken frequency-bin independent in DFT-based enhancement algorithms. Experiments with time-varying RIRs validate the analysis. Experiments with additive nonstationary noise and time-invariant RIRs show the influence of blind estimation of the reverberation time and the DRR.
Keywords :
discrete Fourier transforms; reverberation; speech enhancement; statistical analysis; stochastic processes; time-varying channels; DFT-based enhancement algorithms; DRR; correlation-based blind single-channel late-reverberation suppression; direct-to-reverberation ratio; discrete Fourier transform; late reverberant spectral variance; model-based LRSV estimators; model-based blind single-channel late-reverberation suppression; noisy time-varying acoustical environments; room impulse response; single-channel speech recordings; speech enhancement; statistical models; stochastic process; time-varying RIR model parameters; Additive noise; Decorrelation; Frequency; Permission; Speech enhancement; Stochastic processes; Working environment noise; Discrete Fourier transform (DFT)-based speech enhancement;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2010.2051271
Filename :
5471178
Link To Document :
بازگشت