Title :
The description of iFlyTek Speech Lab system for NIST2009 Language Recognition Evaluation
Author :
Xu, Ying ; Song, Yan ; Long, Yan-hua ; Zhong, Hai-Bing ; Dai, Li-Rong
Author_Institution :
iFlyTek Speech Lab., Univ. of Sci. & Technol. of China, Hefei, China
fDate :
Nov. 29 2010-Dec. 3 2010
Abstract :
In this paper, we present a description of the iFlyTek Speech Lab system for NIST 2009 LRE (Language Recognition Evaluation). The system consists of acoustic systems (i.e. GMM-MMI and GMM-SVM) and phonotactic systems (i.e. PPR 4-gram LM and PPR 3-gram SVM). First, we describe several state-of-the-art techniques applied in our language recognition system, such as FA (Factor Analysis), MMI (Maximum Mutual Information), and generative and discriminative LM (Language Modelling) techniques etc. Then, we will discuss our data preprocessing techniques for handling large amount training and development data, and the mismatch among different languages, genders and channels. Finally, the evaluation results for NIST2009´s tasks and detailed analysis are given for 30, 10 and 3 seconds durations.
Keywords :
speech recognition; LM; MMI; NIST2009 language recognition evaluation; acoustic systems; factor analysis; iFlyTek speech lab system; language modelling; language recognition system; maximum mutual information; phonotactic systems; state-of-the-art techniques; Acoustics; Adaptation model; Hidden Markov models; NIST; Speech; Support vector machines; Training; Acoustic Systems; Channel Compensation; NIST2009 LRE; Phonotactic System;
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
Conference_Location :
Tainan
Print_ISBN :
978-1-4244-6244-5
DOI :
10.1109/ISCSLP.2010.5684492