Title :
Channel-dependent GMM and Multi-class Logistic Regression models for language recognition
Author :
Van Leeuwen, David A. ; Brümmer, Niko
Author_Institution :
TNO Human Factors, Soesterberg
Abstract :
This paper describes two new approaches to spoken language recognition. These were both successfully applied in the NIST 2005 Language Recognition Evaluation. The first approach extends the Gaussian mixture model technique with channel dependency, which results in actual detection costs (CDET) of 0.095 in NIST LRE-2005, and which should be compared to a traditional 2-gender dependency of GMM language models achieving 0.120. The second approach is a multi-class logistic regression system, which operates similarly to a support vector machine (SVM), but can be trained for all languages simultaneously. This new approach resulted in a CDET of 0.198. The joint TNO-Spescom Datavoice (TNO-SDV) submission to NIST LRE-2005 contained two more systems and obtained a result of 0.0958
Keywords :
Gaussian channels; natural languages; regression analysis; speaker recognition; support vector machines; Gaussian mixture model technique; NIST 2005 Language Recognition Evaluation; SVM; TNO-SDV; channel-dependent GMM; joint TNO-Spescom Datavoice; multiclass logistic regression model; spoken language recognition; support vector machine; Africa; Cost function; Human factors; Logistics; Materials testing; NIST; Natural languages; Speaker recognition; Speech recognition; Support vector machines;
Conference_Titel :
Speaker and Language Recognition Workshop, 2006. IEEE Odyssey 2006: The
Conference_Location :
San Juan
Print_ISBN :
1-424400471-1
Electronic_ISBN :
1-4244-0472-X
DOI :
10.1109/ODYSSEY.2006.248094