Title :
The Adaptation Schemes In PR-SVM Based Language Recognition
Author :
Xu Bing ; Song Yan ; Lirong Dai
Author_Institution :
iFlytek Speech Lab., Univ. of Sci. & Technol. of China, China
Abstract :
Phonetic-based systems usually convert the input speech into token (i.e. word, phone etc.) sequence and determine the target language from the statistics of the token sequences on different languages. Generally, there are two kinds of statistical representation for token sequences, N-gram language model (PR-LM) and support vector machines (PR- SVM) to perform language classification. In this paper we focus on PR-SVM method. One problem of the PR-SVM is that the statistical representation based on utterance is sparse and inaccurate. To tackle this issue, the adaptation schemes in PR-SVM framework are proposed in this paper. There are two schemes to be used: 1) Adaptation from the Universal N-gram Language Model (UNLM) trained on all languages; 2) Adaptation from the Low-Order N-gram Language Model (LONLM). The experimental results on 2007 NIST LRE tasks show that our method achieves significant gains over the unadapted model.
Keywords :
natural language processing; speech recognition; support vector machines; PR-SVM; adaptation schemes; language recognition; low-order n-gram language model; phonetic-based systems; support vector machines; universal n-gram language model; Flowcharts; Hidden Markov models; Kernel; Lattices; NIST; Natural languages; Speech recognition; Statistics; Support vector machine classification; Support vector machines;
Conference_Titel :
Chinese Spoken Language Processing, 2008. ISCSLP '08. 6th International Symposium on
Conference_Location :
Kunming
Print_ISBN :
978-1-4244-2942-4
Electronic_ISBN :
978-1-4244-2943-1
DOI :
10.1109/CHINSL.2008.ECP.95