مرکز منطقه ای اطلاع رساني علوم و فناوري - طراحي تخمين‌گر بيشينه درستنمايي در بهسازي گفتار مبتني بر كتاب كد با نسبت سيگنال به نويز منفي

شماره ركورد :

1284459

عنوان مقاله :

طراحي تخمين‌گر بيشينه درستنمايي در بهسازي گفتار مبتني بر كتاب كد با نسبت سيگنال به نويز منفي

عنوان به زبان ديگر :

Designing Maximum Likelihood Estimator in the Codebook Based Speech Enhancement with Negative Signal to Noise Ratio

پديد آورندگان :

دوست، رقيه پژوهشگاه ارتباطات و فناوري اطلاعات (مركز تحقيقات مخابرات ايران) - پژوهشكده فناوري اطلاعات

تعداد صفحه :

از صفحه :

از صفحه (ادامه) :

تا صفحه :

تا صفحه(ادامه) :

كليدواژه :

بهسازي گفتار , كتاب كد , نسبت سيگنال به نويز(SNR) , سنتز گفتار

چكيده فارسي :

در اين مقاله تخمين‌گر جديدي براي بهسازي گفتار با روش سنتز مبتني بر كتاب كد ارائه مي‌شود. در روش بهسازي گفتار مبتني بر كتاب كد، جداسازي نويز و گفتار از يكديگر انجام شده و با انتخاب بهينه انديس‌هاي كتاب كد گفتار، سيگنال گفتار بهسازي شده سنتز مي‌شود. از اين رو با اين روش مي‌توان گفتارهاي نويزي، با نسبت سيگنال به نويز كمتر از صفر دسيبل را بهسازي نمود. البته در اين روش انتخاب صحيح انديس‌هاي كتاب كد بسيار مهم است. از اين رو در اين مقاله تخمين‌گر بيشينه درست‌نمايي با اعمال وزن‌هاي بهبود دهنده كيفيت شنيداري، براي گفتار و نويز طراحي مي‌شود. رابطه به دست آمده براي اين تخمين‌گر به عنوان تابع فاصله در طراحي كتاب‌هاي كد نيز استفاده مي‌شود. اين روش براي گوينده-هاي مختلف و نويزهاي گوناگون شبيه‌سازي شد. نتايج نشان مي‌دهد كه گفتار بهسازي شده با استفاده از تخمين گر بيشينه درست نمايي با وزن‌هاي كيفيت شنيداري نسبت به تخمين‌گر فاصله اقليدسي، كيفيت شنيداري بهتري دارد. همچنين روش ارائه شده در برخورد با نويزهاي غيرايستان يا ايستان و نسبت سيگنال به نويز منفي(يا مثبت) موفق‌تر از روش‌هاي ديگر عمل مي‌كند. هزينه بهسازي با كيفيت برتر در اين روش، نياز به زمان نسبتاً طولاني براي بهسازي است.

چكيده لاتين :

This paper presents a new estimator for the speech enhancement using codebook. Codebook-based speech enhancement method separates the noise and speech from each other and synthesizes the enhanced speech signal by optimally selecting the speech codebook indexes. This method can enhance the noisy speech with signal to noise ratio of less than zero decibel. In this method it is very important to select the correct codebook indexes. Therefore, in this paper, the maximum likelihood estimator is proposed for speech and noise by applying auditory quality-enhancing weights. The relation of this estimator is also used as a distance function in the design of codebooks. This method is simulated for different speakers and noises. The results show the proposed maximum likelihood estimator leads to better speech enhancement than the euclidean distance estimator. The proposed method is also more successful in dealing with non-stationary or stationary noises and negative or positive SNRs than other methods. The cost of the superior quality enhancement in this method is the requirement to a relatively time-consuming signal processing.

سال انتشار :

1400

عنوان نشريه :

صنايع الكترونيك

فايل PDF :

8673924

لينک به اين مدرک :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=8&DC=1284459