DocumentCode :
2016725
Title :
Mandarin-English bilingual phone modeling and combining MPE based Discriminative training for cross-language speech recognition
Author :
Qian, Yanmin ; Liu, Jia
Author_Institution :
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
fYear :
2010
fDate :
Nov. 29 2010-Dec. 3 2010
Firstpage :
103
Lastpage :
108
Abstract :
Automatic multilingual speech recognition is always a difficult task. This paper presents recent work on the development of a Mandarin-English bilingual speech recognition system. Firstly a universal set of bilingual acoustic models based on a novel State-Time-Alignment (STA) method is proposed to balance the performance and the complexity of the bilingual speech recognition system. Then Discriminative training approaches such as discriminative Gaussian training using the minimum phone error (MPE) criterion and the discriminatively trained feature transform fMPE, which are proved to improved monolingual recognition performance, are modified to manage bilingual speech recognition system. A new method is applied to generate significantly better lattices for training the bilingual model, and complementary discriminative training methods are also explored to get the best ROVER performance in the bilingual situation. Experimental results show that the STA phone clustering method outperforms other existing phone clustering methods. Furthermore both forms of discriminative training reduce the word error rate of the multilingual system, and combining complementary discriminative training methods improves the performance significantly.
Keywords :
computational linguistics; natural language processing; speech recognition; vocabulary; MPE based discriminative training; Mandarin-English bilingual phone modeling; ROVER performance; STA phone clustering; automatic multilingual speech recognition; bilingual acoustic models; cross-language speech recognition; minimum phone error; monolingual recognition performance; state-time-alignment method; bilingual speech recognition; discriminative training; phone clustering; system combination;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
Conference_Location :
Tainan
Print_ISBN :
978-1-4244-6244-5
Type :
conf
DOI :
10.1109/ISCSLP.2010.5684841
Filename :
5684841
Link To Document :
بازگشت