مرکز منطقه ای اطلاع رساني علوم و فناوري - Optimized Speech Dereverberation From Probabilistic Perspective for Time Varying Acoustic Transfer Function

DocumentCode :

78999

Title :

Optimized Speech Dereverberation From Probabilistic Perspective for Time Varying Acoustic Transfer Function

Author :

Togami, Masahito ; Kawaguchi, Yuki ; Takeda, Ryu ; Obuchi, Yasunari ; Nukaga, N.

Author_Institution :

Central Res. Lab., Hitachi Ltd., Kokubunji, Japan

Volume :

Issue :

fYear :

2013

fDate :

Jul-13

Firstpage :

1369

Lastpage :

1380

Abstract :

A dereverberation technique has been developed that optimally combines multichannel inverse filtering (MIF), beamforming (BF), and non-linear reverberation suppression (NRS). It is robust against acoustic transfer function (ATF) fluctuations and creates less distortion than the NRS alone. The three components are optimally combined from a probabilistic perspective using a unified likelihood function incorporating two probabilistic models. A multichannel probabilistic source model based on a recently proposed local Gaussian model (LGM) provides robustness against ATF fluctuations of the early reflection. A probabilistic reverberant transfer function model (PRTFM) provides robustness against ATF fluctuations of the late reverberation. The MIF and multichannel under-determined source separation (MUSS) are optimized in an iterative manner. The MIF is designed to reduce the time-invariant part of the late reverberation by using optimal time-weighting with reference to the PRTFM and the LGM. The MUSS separates the dereverberated speech signal and the residual reverberation after the MIF, which can be interpreted as an optimized combination of the BF and the NRS. The parameters of the PRTFM and the LGM are optimized based on the MUSS output. Experimental results show that the proposed method is robust against the ATF fluctuations under both single and multiple source conditions.

Keywords :

Gaussian channels; array signal processing; filtering theory; iterative methods; probability; reverberation; source separation; transfer functions; ATF fluctuations; beamforming; dereverberated speech signal; iterative manner; local Gaussian model; multichannel inverse filtering; multichannel probabilistic source model; multichannel under-determined source separation; multiple source conditions; nonlinear reverberation suppression; optimal time-weighting; optimized speech dereverberation; probabilistic perspective; probabilistic reverberant transfer function model; residual reverberation; single source conditions; time varying acoustic transfer function; time-invariant part; unified likelihood function; Microphones; Nonlinear distortion; Probabilistic logic; Reverberation; Robustness; Speech; Transfer functions; Dereverberation; expectation-maximization algorithm; local Gaussian modeling; multichannel filtering; time-varying acoustic transfer function;

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2013.2250960

Filename :

6473840

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=78999