Title :
Model Adaptation for Long Convolutional Distortion by Maximum Likelihood Based State Filtering Approach
Author :
Raut, Chandra Kant ; Nishimoto, Takuya ; Sagayama, Shigeki
Author_Institution :
Graduate Sch. of Inf. Sci. & Technol., Tokyo Univ.
Abstract :
In environment with considerably long reverberation time, each frame of speech is affected by energy components from the preceding frames. Therefore, to adapt parameters of a state of HMM, it becomes necessary to consider these frames, and compute their contributions to current state. However, these speech frames preceding to a state of HMM are not known during adaptation of the models. In this paper, we propose to use preceding states as units of preceding speech segments, estimate their contributions to current state in maximum likelihood manner, and adapt models by accounting their contributions. When clean models were adapted by proposed method for a speaker-dependent isolated word recognition task, word accuracy of the system typically increased from 67.6% to 83.2%, and from 44.8% to 72.5%, for channel distorted speech simulated by linear convolution of clean speech and impulse responses with reverberation time (T60) of 310 ms and 780 ms, respectively
Keywords :
filtering theory; hidden Markov models; maximum likelihood estimation; speech processing; speech recognition; HMM; channel distorted speech; long convolutional distortion; maximum likelihood approach; preceding speech segments; reverberation time; speaker-dependent isolated word recognition; state filtering approach; Adaptation model; Additive noise; Automatic speech recognition; Convolution; Convolutional codes; Filtering; Hidden Markov models; Maximum likelihood estimation; Reverberation; Speech enhancement;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
Print_ISBN :
1-4244-0469-X
DOI :
10.1109/ICASSP.2006.1660225