مرکز منطقه ای اطلاع رساني علوم و فناوري - Speech enhancement using a mixture-maximum model

DocumentCode :

839052

Title :

Speech enhancement using a mixture-maximum model

Author :

Burshtein, David ; Gannot, Sharon

Author_Institution :

Dept. of Electr. Eng.-Syst., Tel Aviv Univ., Israel

Volume :

Issue :

fYear :

2002

fDate :

9/1/2002 12:00:00 AM

Firstpage :

341

Lastpage :

351

Abstract :

We present a spectral domain, speech enhancement algorithm. The new algorithm is based on a mixture model for the short time spectrum of the clean speech signal, and on a maximum assumption in the production of the noisy speech spectrum. In the past this model was used in the context of noise robust speech recognition. In this paper we show that this model is also effective for improving the quality of speech signals corrupted by additive noise. The computational requirements of the algorithm can be significantly reduced, essentially without paying performance penalties, by incorporating a dual codebook scheme with tied variances. Experiments, using recorded speech signals and actual noise sources, show that in spite of its low computational requirements, the algorithm shows improved performance compared to alternative speech enhancement algorithms.

Keywords :

Gaussian processes; computational complexity; noise; spectral analysis; speech coding; speech enhancement; speech intelligibility; speech recognition; Gaussian mixture model; MIXMAX model; additive noise; clean speech signal; dual codebook; low computational requirements; mixture model; mixture-maximum model; noise robust speech recognition; noise sources; noisy speech spectrum; performance penalties; recorded speech signals; short time spectrum; spectral domain; speech enhancement algorithm; speech intelligibility; speech signal quality; tied variances; Additive noise; Amplitude estimation; Background noise; Context modeling; Filtering algorithms; Hidden Markov models; Noise robustness; Speech coding; Speech enhancement; Speech recognition;

fLanguage :

English

Journal_Title :

Speech and Audio Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1063-6676

Type :

jour

DOI :

10.1109/TSA.2002.803420

Filename :

1040258

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=839052