DocumentCode :
78999
Title :
Optimized Speech Dereverberation From Probabilistic Perspective for Time Varying Acoustic Transfer Function
Author :
Togami, Masahito ; Kawaguchi, Yuki ; Takeda, Ryu ; Obuchi, Yasunari ; Nukaga, N.
Author_Institution :
Central Res. Lab., Hitachi Ltd., Kokubunji, Japan
Volume :
21
Issue :
7
fYear :
2013
fDate :
Jul-13
Firstpage :
1369
Lastpage :
1380
Abstract :
A dereverberation technique has been developed that optimally combines multichannel inverse filtering (MIF), beamforming (BF), and non-linear reverberation suppression (NRS). It is robust against acoustic transfer function (ATF) fluctuations and creates less distortion than the NRS alone. The three components are optimally combined from a probabilistic perspective using a unified likelihood function incorporating two probabilistic models. A multichannel probabilistic source model based on a recently proposed local Gaussian model (LGM) provides robustness against ATF fluctuations of the early reflection. A probabilistic reverberant transfer function model (PRTFM) provides robustness against ATF fluctuations of the late reverberation. The MIF and multichannel under-determined source separation (MUSS) are optimized in an iterative manner. The MIF is designed to reduce the time-invariant part of the late reverberation by using optimal time-weighting with reference to the PRTFM and the LGM. The MUSS separates the dereverberated speech signal and the residual reverberation after the MIF, which can be interpreted as an optimized combination of the BF and the NRS. The parameters of the PRTFM and the LGM are optimized based on the MUSS output. Experimental results show that the proposed method is robust against the ATF fluctuations under both single and multiple source conditions.
Keywords :
Gaussian channels; array signal processing; filtering theory; iterative methods; probability; reverberation; source separation; transfer functions; ATF fluctuations; beamforming; dereverberated speech signal; iterative manner; local Gaussian model; multichannel inverse filtering; multichannel probabilistic source model; multichannel under-determined source separation; multiple source conditions; nonlinear reverberation suppression; optimal time-weighting; optimized speech dereverberation; probabilistic perspective; probabilistic reverberant transfer function model; residual reverberation; single source conditions; time varying acoustic transfer function; time-invariant part; unified likelihood function; Microphones; Nonlinear distortion; Probabilistic logic; Reverberation; Robustness; Speech; Transfer functions; Dereverberation; expectation-maximization algorithm; local Gaussian modeling; multichannel filtering; time-varying acoustic transfer function;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2013.2250960
Filename :
6473840
Link To Document :
بازگشت