مرکز منطقه ای اطلاع رساني علوم و فناوري - Model Adaptation for Long Convolutional Distortion by Maximum Likelihood Based State Filtering Approach

DocumentCode :

454727

Title :

Model Adaptation for Long Convolutional Distortion by Maximum Likelihood Based State Filtering Approach

Author :

Raut, Chandra Kant ; Nishimoto, Takuya ; Sagayama, Shigeki

Author_Institution :

Graduate Sch. of Inf. Sci. & Technol., Tokyo Univ.

Volume :

fYear :

2006

fDate :

14-19 May 2006

Abstract :

In environment with considerably long reverberation time, each frame of speech is affected by energy components from the preceding frames. Therefore, to adapt parameters of a state of HMM, it becomes necessary to consider these frames, and compute their contributions to current state. However, these speech frames preceding to a state of HMM are not known during adaptation of the models. In this paper, we propose to use preceding states as units of preceding speech segments, estimate their contributions to current state in maximum likelihood manner, and adapt models by accounting their contributions. When clean models were adapted by proposed method for a speaker-dependent isolated word recognition task, word accuracy of the system typically increased from 67.6% to 83.2%, and from 44.8% to 72.5%, for channel distorted speech simulated by linear convolution of clean speech and impulse responses with reverberation time (T₆₀) of 310 ms and 780 ms, respectively

Keywords :

filtering theory; hidden Markov models; maximum likelihood estimation; speech processing; speech recognition; HMM; channel distorted speech; long convolutional distortion; maximum likelihood approach; preceding speech segments; reverberation time; speaker-dependent isolated word recognition; state filtering approach; Adaptation model; Additive noise; Automatic speech recognition; Convolution; Convolutional codes; Filtering; Hidden Markov models; Maximum likelihood estimation; Reverberation; Speech enhancement;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on

Conference_Location :

Toulouse

ISSN :

1520-6149

Print_ISBN :

1-4244-0469-X

Type :

conf

DOI :

10.1109/ICASSP.2006.1660225

Filename :

1660225

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=454727