مرکز منطقه ای اطلاع رساني علوم و فناوري - Kalman-filtering speech enhancement method based on a voiced-unvoiced speech model

DocumentCode :

1544779

Title :

Kalman-filtering speech enhancement method based on a voiced-unvoiced speech model

Author :

Goh, Zenton ; Kah-Chye Tan ; Tan, B.T.G.

Author_Institution :

Sch. of Electr. & Electron. Eng., Nanyang Technol. Inst., Singapore

Volume :

Issue :

fYear :

1999

fDate :

9/1/1999 12:00:00 AM

Firstpage :

510

Lastpage :

524

Abstract :

In this work, we are concerned with optimal estimation of clean speech from its noisy version based on a speech model we propose. We first propose a (single) speech model which satisfactorily describes voiced and unvoiced speech and silence (i.e., pauses between speech utterances), and also allows for exploitation of the long term characteristics of noise. We then reformulate the model equations so as to facilitate subsequent application of the well-established Kalman filter for computing the optimal estimate of the clean speech in the minimum-mean-square-error sense. Since the standard algorithm for Kalman-filtering involves multiplications of very large matrices and thus demands high computational cost, we devise a mathematically equivalent algorithm which is computationally much more efficient, by exploiting the sparsity of the matrices concerned. Next, we present the methods we use for estimating the model parameters and give a complete description of the enhancement process. Performance assessment based on spectrogram plots, objective measures and informal subjective listening tests all indicate that our method gives consistently good results. As far as signal-to-noise ratio is concerned, the improvements over existing methods can be as high as 4 dB

Keywords :

Gaussian noise; Kalman filters; filtering theory; noise; optimisation; parameter estimation; speech enhancement; white noise; Kalman-filtering speech enhancement method; SNR; clean speech; high computational cost; informal subjective listening tests; long term characteristics; mathematically equivalent algorithm; matrix multiplication; minimum-mean-square-error; model equations; model parameters estimation; objective measures; optimal estimation; pauses; performance assessment; signal-to-noise ratio; silence; sparse matrices; spectrogram plots; speech model; speech utterances; unvoiced speech; voiced speech; voiced-unvoiced speech model; white Gaussian noise; Computational efficiency; Equations; Filtering algorithms; Kalman filters; Parameter estimation; Signal processing; Spectrogram; Speech coding; Speech enhancement; Speech processing;

fLanguage :

English

Journal_Title :

Speech and Audio Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1063-6676

Type :

jour

DOI :

10.1109/89.784103

Filename :

784103

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1544779