Title :
Discriminative boosting regression backend for phonotactic language recognition
Author :
Wei-Wei Liu ; Wei-Qiang Zhang ; Jia Liu
Author_Institution :
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
Abstract :
In spoken language recognition (SLR), discriminatively trained models always outperform non-discriminative models but computationally expensive and complex to implement. In this paper, we explore a novel approach to discriminative vector space model (VSM) training by using a boosting regression framework, in which an ensemble of VSMs is trained sequentially. The effectiveness of our boosting variation comes from the emphasis on working with the high confidence test data to achieve discriminatively trained models. Our variant of boosting also includes utilizing original training data in VSM training. The discriminative boosting regression (DBR) is applied to the National Institute of Standards and Technology (NIST) language recognition evaluation (LRE) 2009 task and show performance improvements. The experimental results demonstrate that the proposed DBR shows 4.13%, 14.38% and 14.22% relative reduction for 30s, 10s and 3s test utterances in equal error rate (EER) than baseline system.
Keywords :
learning (artificial intelligence); regression analysis; speech recognition; support vector machines; DBR; EER; NIST LRE task; National Institute of Standards and Technology; SLR; VSM training; boosting variation; discriminative boosting regression backend; discriminative vector space model; equal error rate; language recognition evaluation task; phonotactic language recognition; spoken language recognition; test utterance; Boosting; Databases; Distributed Bragg reflectors; Error analysis; NIST; Support vector machines; Training; discriminative boosting regression (DBR); language recognition;
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
Conference_Location :
Singapore
DOI :
10.1109/ISCSLP.2014.6936600