DocumentCode :
1294719
Title :
Speech Dereverberation Based on Variance-Normalized Delayed Linear Prediction
Author :
Nakatani, Tomohiro ; Yoshioka, Takuya ; Kinoshita, Keisuke ; Miyoshi, Masato ; Juang, Biing-Hwang
Author_Institution :
NTT Commun. Sci. Labs., NTT Corp., Kyoto, Japan
Volume :
18
Issue :
7
fYear :
2010
Firstpage :
1717
Lastpage :
1731
Abstract :
This paper proposes a statistical model-based speech dereverberation approach that can cancel the late reverberation of a reverberant speech signal captured by distant microphones without prior knowledge of the room impulse responses. With this approach, the generative model of the captured signal is composed of a source process, which is assumed to be a Gaussian process with a time-varying variance, and an observation process modeled by a delayed linear prediction (DLP). The optimization objective for the dereverberation problem is derived to be the sum of the squared prediction errors normalized by the source variances; hence, this approach is referred to as variance-normalized delayed linear prediction (NDLP). Inheriting the characteristic of DLP, NDLP can robustly estimate an inverse system for late reverberation in the presence of noise without greatly distorting a direct speech signal. In addition, owing to the use of variance normalization, NDLP allows us to improve the dereverberation result especially with relatively short (of the order of a few seconds) observations. Furthermore, NDLP can be implemented in a computationally efficient manner in the time-frequency domain. Experimental results demonstrate the effectiveness and efficiency of the proposed approach in comparison with two existing approaches.
Keywords :
Gaussian processes; speech processing; Gaussian process; NDLP; delayed linear prediction; dereverberation problem; distant microphones; speech dereverberation; speech signal; time-varying variance; variance-normalized delayed linear prediction; Correlation; Microphones; Optimization; Predictive models; Reverberation; Speech; Speech processing; Dereverberation; inverse filtering; multichannel linear prediction; speech enhancement; statistical model-based signal processing;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2010.2052251
Filename :
5547558
Link To Document :
بازگشت