مرکز منطقه ای اطلاع رساني علوم و فناوري - Speech Enhancement, Gain, and Noise Spectrum Adaptation Using Approximate Bayesian Estimation

DocumentCode :

1059879

Title :

Speech Enhancement, Gain, and Noise Spectrum Adaptation Using Approximate Bayesian Estimation

Author :

Jiucang Hao ; Attias, H. ; Nagarajan, Sasi ; Te-Won Lee ; Sejnowski, Terrence J.

Author_Institution :

Inst. for Neural Comput., Univ. of California, San Diego, CA

Volume :

Issue :

fYear :

2009

Firstpage :

Lastpage :

Abstract :

This paper presents a new approximate Bayesian estimator for enhancing a noisy speech signal. The speech model is assumed to be a Gaussian mixture model (GMM) in the log-spectral domain. This is in contrast to most current models in frequency domain. Exact signal estimation is a computationally intractable problem. We derive three approximations to enhance the efficiency of signal estimation. The Gaussian approximation transforms the log-spectral domain GMM into the frequency domain using minimal Kullback-Leiber (KL)-divergency criterion. The frequency domain Laplace method computes the maximum a posteriori (MAP) estimator for the spectral amplitude. Correspondingly, the log-spectral domain Laplace method computes the MAP estimator for the log-spectral amplitude. Further, the gain and noise spectrum adaptation are implemented using the expectation-maximization (EM) algorithm within the GMM under Gaussian approximation. The proposed algorithms are evaluated by applying them to enhance the speeches corrupted by the speech-shaped noise (SSN). The experimental results demonstrate that the proposed algorithms offer improved signal-to-noise ratio, lower word recognition error rate, and less spectral distortion.

Keywords :

Bayes methods; Gaussian processes; Laplace transforms; approximation theory; expectation-maximisation algorithm; frequency-domain analysis; maximum likelihood estimation; spectral-domain analysis; speech enhancement; Gaussian approximation transforms; Gaussian mixture model; approximate Bayesian estimation; expectation-maximization algorithm; frequency domain Laplace method; log-spectral domain method; maximum a posteriori estimator; minimal Kullback-Leiber divergency criterion; noise spectrum adaptation; noisy speech signal enhancement; signal estimation; Amplitude estimation; Approximation algorithms; Bayesian methods; Frequency domain analysis; Frequency estimation; Gaussian approximation; Gaussian noise; Signal to noise ratio; Speech analysis; Speech enhancement; Approximate Bayesian estimation; Gaussian mixture model (GMM); speech enhancement;

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2008.2005342

Filename :

4740160

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1059879