Title :
Spoken Language Recognition Using Ensemble Classifiers
Author :
Ma, Bin ; Li, Haizhou ; Tong, Rong
Author_Institution :
Inst. for Infocomm Res., Singapore
Abstract :
In this paper, we study a novel approach to spoken language recognition using an ensemble of binary classifiers. In this framework, we begin by representing a speech utterance with a high-dimensional feature vector such as the phonotactic characteristics or the polynomial expansion of cepstral features. A binary classifier can be built based on such feature vectors. We adopt a distributed output coding strategy in ensemble classifier design, where we decompose a multiclass language recognition problem into many binary classification tasks, each of which addresses a language recognition subtask by using a component classifier. Then, we combine the results of the component classifiers to form an output code as a hypothesized solution to the overall language recognition problem. In this way, we effectively project high-dimensional feature vectors into a tractable low-dimensional space, yet maintaining language discriminative characteristics of the spoken utterances. By fusing the output codes from both phonotactic features and cepstral features, we achieve equal-error-rates of 1.38% and 3.20% for 30-s trials on the 2003 and 2005 NIST language recognition evaluation databases.
Keywords :
natural language processing; speech coding; speech recognition; binary classifiers; component classifier; distributed output coding strategy; ensemble classifiers; equal-error-rates; high-dimensional feature vector; language discriminative characteristics; multiclass language recognition problem; polynomial expansion; speech utterance; spoken language recognition; Cepstral analysis; Globalization; Kernel; NIST; Natural languages; Polynomials; Spatial databases; Speech recognition; Statistics; Vectors; Component classifier selection; ensemble classifiers; output codes; spoken language recognition;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2007.902861