DocumentCode
590612
Title
Optimizing the parameters of decoding graphs using new log-based MCE
Author
Abdelhamid, Abdelaziz A. ; Abdulla, Waleed H.
Author_Institution
Electr. & Comput. Eng., Univ. of Auckland, Auckland, New Zealand
fYear
2012
fDate
3-6 Dec. 2012
Firstpage
1
Lastpage
5
Abstract
This paper proposes a new class loss function as an alternative to the standard sigmoid class loss function for optimizing the parameters of decoding graphs using discriminative training based on minimum classification error (MCE) criterion. The standard sigmoid based approach tends to ignore a significant number of training samples that have a large difference between the scores of the reference and their corresponding competing hypotheses and this affects the parameters optimization. The proposed function overcomes this limitation through considering almost all the training samples and thus improved the parameter optimization when tested on large decoding graphs. The decoding graph used in this research is an integrated network of weighted finite state transducers. The primary task examined is 64K words, continuous speech recognition task. The experimental results show that the proposed method outperformed the baseline system based on both the maximum likelihood estimation (MLE) and sigmoid-based MCE and achieved a reduction in the word error rate (WER) of 28.9% when tested on the TIMIT speech database.
Keywords
graph theory; maximum likelihood estimation; optimisation; pattern classification; speech coding; MLE; TIMIT speech database; WER; decoding graphs; discriminative training; log-based MCE criterion; maximum likelihood estimation; minimum classification error; parameters optimization; sigmoid class loss function; word error rate; Acoustics; Decoding; Hidden Markov models; Optimization; Speech; Speech processing; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC), 2012 Asia-Pacific
Conference_Location
Hollywood, CA
Print_ISBN
978-1-4673-4863-8
Type
conf
Filename
6411759
Link To Document