DocumentCode :
2693209
Title :
Towards more efficient and accurate methods for Mandarin LVCSR discriminative training
Author :
Xu, Haihua ; Zhu, Jie
Author_Institution :
Dept. of Electron. Eng., Shanghai Jiao Tong Univ., Shanghai
fYear :
2008
fDate :
June 23 2008-April 26 2008
Firstpage :
981
Lastpage :
984
Abstract :
Discriminative training of Mandarin large vocabulary continuous speech recognition (LVCSR) has been remarkably improved in speech community recent years. However, much work still needs further investigating. In this work, we focus on improvements to two aspects of discriminative training method, in particular related to minimum phone error (MPE) training method in Mandarin speech recognition. One is to use syllable, not multi-character word, as speech recognition unit (SRU) to generate phone lattice to train models. The other is to investigate better objective functions related to MPE with comparisons on recent proposed methods. Experimental results showed that the proposed methods improved both efficiency and accuracy for discriminative training in Mandarin speech recognition.
Keywords :
natural language processing; speech recognition; Mandarin LVCSR discriminative training; Mandarin speech recognition; large vocabulary continuous speech recognition; minimum phone error training; phone lattice; Automatic speech recognition; Error analysis; Hidden Markov models; Lattices; Mutual information; Speech recognition; Statistical analysis; Testing; Training data; Vocabulary; Discriminative Training; Minimum Bayes Risk; Minimum Frame Error; Minimum Phone Error; SRU; Speech Recognition Unit;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2008 IEEE International Conference on
Conference_Location :
Hannover
Print_ISBN :
978-1-4244-2570-9
Electronic_ISBN :
978-1-4244-2571-6
Type :
conf
DOI :
10.1109/ICME.2008.4607601
Filename :
4607601
Link To Document :
بازگشت