Title :
A Hierarchical System Design for Language Identification
Author :
Wang, Haipeng ; Xiao, Xiang ; Zhang, Xiang ; Zhang, Jianping ; Yan, Yonghong
Author_Institution :
ThinkIT Speech Lab., Chinese Acad. of Sci., Beijing, China
Abstract :
Token-based approaches have proven quite effective for spoken language identification (LID). Traditionally, Speech utterances are first decoded into token sequences, and then LID tasks are performed on these token sequences by either n-gram language models or support vector machines. In this paper, we propose a hierarchical system design, which utilizes a group of bayesian logistic regression models as score generators. Score generators are then followed by a score merger, which outputs the final identification results. Experiments conducted on the NISR LRE 2007 databases demonstrate that the proposed approach achieves quite competitive performance compared to other traditional token-based methods.
Keywords :
Bayes methods; natural language processing; regression analysis; speech processing; support vector machines; LID tasks; NISR LRE 2007 databases; bayesian logistic regression models; hierarchical system design; n-gram language models; score generators; score merger; speech utterances; spoken language identification; support vector machines; token-based approach; Bayesian methods; Corporate acquisitions; Databases; Decoding; Hierarchical systems; Logistics; Natural languages; Speech; Support vector machine classification; Support vector machines; bayesian logistic regression model; hierarchical system design; language identification;
Conference_Titel :
Information Science and Engineering (ISISE), 2009 Second International Symposium on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-6325-1
Electronic_ISBN :
978-1-4244-6326-8
DOI :
10.1109/ISISE.2009.102