مرکز منطقه ای اطلاع رساني علوم و فناوري - Speech Dereverberation Based on Variance-Normalized Delayed Linear Prediction

DocumentCode :

1294719

Title :

Speech Dereverberation Based on Variance-Normalized Delayed Linear Prediction

Author :

Nakatani, Tomohiro ; Yoshioka, Takuya ; Kinoshita, Keisuke ; Miyoshi, Masato ; Juang, Biing-Hwang

Author_Institution :

NTT Commun. Sci. Labs., NTT Corp., Kyoto, Japan

Volume :

Issue :

fYear :

2010

Firstpage :

1717

Lastpage :

1731

Abstract :

This paper proposes a statistical model-based speech dereverberation approach that can cancel the late reverberation of a reverberant speech signal captured by distant microphones without prior knowledge of the room impulse responses. With this approach, the generative model of the captured signal is composed of a source process, which is assumed to be a Gaussian process with a time-varying variance, and an observation process modeled by a delayed linear prediction (DLP). The optimization objective for the dereverberation problem is derived to be the sum of the squared prediction errors normalized by the source variances; hence, this approach is referred to as variance-normalized delayed linear prediction (NDLP). Inheriting the characteristic of DLP, NDLP can robustly estimate an inverse system for late reverberation in the presence of noise without greatly distorting a direct speech signal. In addition, owing to the use of variance normalization, NDLP allows us to improve the dereverberation result especially with relatively short (of the order of a few seconds) observations. Furthermore, NDLP can be implemented in a computationally efficient manner in the time-frequency domain. Experimental results demonstrate the effectiveness and efficiency of the proposed approach in comparison with two existing approaches.

Keywords :

Gaussian processes; speech processing; Gaussian process; NDLP; delayed linear prediction; dereverberation problem; distant microphones; speech dereverberation; speech signal; time-varying variance; variance-normalized delayed linear prediction; Correlation; Microphones; Optimization; Predictive models; Reverberation; Speech; Speech processing; Dereverberation; inverse filtering; multichannel linear prediction; speech enhancement; statistical model-based signal processing;

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2010.2052251

Filename :

5547558

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1294719