DocumentCode :
661362
Title :
A particle filter compensation approach to robust LVCSR
Author :
Duc Hoang Ha Nguyen ; Mushtaq, Aleem ; Xiong Xiao ; Eng Siong Chng ; Haizhou Li ; Chin-Hui Lee
Author_Institution :
Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
fYear :
2013
fDate :
Oct. 29 2013-Nov. 1 2013
Firstpage :
1
Lastpage :
7
Abstract :
We extend our previous work on particle filter compensation (PFC) to large vocabulary continuous speech recognition (LVCSR) and conduct the experiments on Aurora-4 database. Obtaining an accurately aligned state and mixture sequence of hidden Markov models (HMMs) that describe the underlying clean speech features being estimated in noise is a challenging task for sub-word based LVCSR because the total number of triphone models involved can be very large. In this paper, we show that by using separate sets of HMMs for recognition and compensation, we can simplify the models used for PFC to a great extent and thus facilitate the estimation of the side information offered in the state and mixture sequences. When the missing side information for PFC is available, a large word error reduction of 28.46% from multi-condition training is observed. In the actual scenarios, an error reduction of only 5.3% is obtained. We are anticipating improved results that will narrow the gap between the system today and what´s achievable if the side information could be exactly specified.
Keywords :
compensation; hidden Markov models; particle filtering (numerical methods); speech recognition; Aurora-4 database; HMM; PFC; hidden Markov models; large vocabulary continuous speech recognition; mixture sequences; multicondition training; particle filter compensation approach; side information estimation; speech feature compensation; state sequences; sub-word based LVCSR; triphone models; word error reduction; Computational modeling; Hidden Markov models; Mel frequency cepstral coefficient; Noise; Noise measurement; Speech; clustering; particle filter; robustness; speech feature compensation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2013 Asia-Pacific
Conference_Location :
Kaohsiung
Type :
conf
DOI :
10.1109/APSIPA.2013.6694223
Filename :
6694223
Link To Document :
بازگشت