DocumentCode :
3410812
Title :
Speech recognitionwith speech density estimation by the Dirichlet Process Mixture
Author :
Ota, Kenko ; Duflos, Emmanuel ; Vanheeghe, Philippe ; Yanagida, Masuzo
Author_Institution :
Ecole Centrale de Lille BP48, LAGIS, Villeneuve-d´´Ascq
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
1553
Lastpage :
1556
Abstract :
This paper shows a method for the modeling of speech signal distributions based on Dirichlet process mixtures (DPM) and the estimation of noise sequences based on particle filtering. In real situations, the speech recognition rate degrades miser ably because of the effect of environmental noises, reflected waves and so on. To improve the speech recognition rate, a technique for the estimation of noise sequences is necessary. In this paper, the distribution of the clean speech is modeled using the DPM instead of the traditional model, which is a Gaussian mixture model (GMM). Speech signal sequences are generated according to the mean and covariance generated from the DPM. Then, noise signal sequences are estimated with a particle filter. The proposed method using extended Kalman filter (EKF) can improve the speech recognition rate significantly in the low SNR region. Applying unscented Kalman filter (UKF), better results can be obtained in also the high SNR.
Keywords :
Gaussian processes; Kalman filters; noise (working environment); nonlinear filters; particle filtering (numerical methods); speech recognition; Dirichlet process mixture; Gaussian mixture model; environmental noise; extended Kalman filter; noise sequence estimation; particle filtering; speech density estimation; speech recognition; speech signal distribution; speech signal sequence; unscented Kalman filter; Degradation; Filtering; Particle filters; Signal generators; Signal processing; Signal to noise ratio; Speech enhancement; Speech processing; Speech recognition; Working environment noise; Kalman filtering; Signal processing; Speech enhancement; Speech recognition; Stochastic processes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
ISSN :
1520-6149
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2008.4517919
Filename :
4517919
Link To Document :
بازگشت