مرکز منطقه ای اطلاع رساني علوم و فناوري - Speech enhancement based on hidden Markov model using sparse code shrinkage

Other language title :

بهسازي گفتار بر پايه مدل مخفي ماركوف با استفاده از گرفتگي كدهاي تنك

Title of article :

Speech enhancement based on hidden Markov model using sparse code shrinkage

Author/Authors :

Golrasan, E Department of Computer Engineering - Sharif University of Technology, Tehran , Sameti, H Department of Computer Engineering - Sharif University of Technology, Tehran

Issue Information :

دوفصلنامه با شماره پیاپی سال 2016

Pages :

From page :

213

To page :

218

Abstract :

This paper presents a new hidden Markov model-based (HMM-based) speech enhancement framework based on the independent component analysis (ICA). We propose analytical procedures for training clean speech and noise models by the Baum re-estimation algorithm and present a Maximum a posterior (MAP) estimator based on Laplace-Gaussian (for clean speech and noise respectively) combination in the HMM framework, namely sparse code shrinkage-HMM (SCS-HMM). The proposed method on TIMIT database in the presence of three noise types at three SNR levels in terms of PESQ and SNR are evaluated and compared with Auto-Regressive HMM (AR-HMM) and speech enhancement based on HMM with discrete cosine transform (DCT) coefficients using Laplace and Gaussian distributions (LaGa-HMMDCT). The results confirm the superiority of SCS-HMM method in presence of non-stationary noises compared to LaGa-HMMDCT. The results of SCS-HMM method represent better performance of this method compared to AR-HMM in presence of white noise based on PESQ measure.

Farsi abstract :

اين مقاله يك چارچوب جديد براي بهسازي گفتار با مدل مخفي ماركوف و مبتني بر تحليل مؤلفه مستقل (ICA) ارائه مي كند. مراحل تحليل براي آموزش مدل هاي گفتار و نويز با استفاده از روش بازتخمين باوم ارائه مي شود و يك تخمين گر MAP مبتني بر تركيب توزيع لاپلاس-گوسي (به ترتيب براي سيگنال هاي گفتار تميز و نويز) در ساختار مبتني بر مدل مخفي ماركوف تحت عنوان SCS-HMM - پيشنهاد مي شود. روش پيشنهادي روي دادگان TIMIT با سه نوع نويز در سه مقدار SNR ورودي با معيارهاي PESQ و SNR خروجي مورد ارزيابي قرار مي گيرد و با دو روش AR-HMM و نيز روش بهسازي بر پايه ضرايب تبديل كسينوسي گسسته با استفاده از توزيع هاي لاپلاس و گوسي (LaGa-HMMdct) مقايسه مي شود. نتايج به دست آمده نشان دهنده برتري SCS-HMMدر حضور نويزهاي ناايستان نسبت به LaGa-HMMdct است . همچنين نتايج SCS-HMM عملكرد بهتر اين روش نسبت به AR - HMM را در حضور نويز سفيد با معيار PESQ نشان مي دهد.

Keywords :

Speech Signal Enhancement , HMM-based Speech Enhancement , Multivariate Laplace Distribution , Independent Component Analysis (ICA transform) , Sparse Code Shrinkage Enhancement Method

Journal title :

Astroparticle Physics

Serial Year :

2016

Record number :

2406365

Link To Document :

https://search.isc.ac/dl/search/defaultta.aspx?DTC=10&DC=2406365