Title :
Language identification based on improved training and classification
Author :
Partovi, Elaheh ; Ahadi, Seyed Mohammad ; Faraji, Neda
Author_Institution :
Electr. Eng. Dept., Amirkabir Univ. of Technol., Tehran, Iran
Abstract :
Language identification is an automatic process of detecting the language of a speech utterance. As an application example, in automatic translation technology, before any recognition or translation, the spoken language must be recognized. In this paper we propose two methods for training and testing the language classifiers. One uses Kullback Leibler Divergence (KLD) for improved training of GMMs and the other is the use of Frame Selection Decoding (FSD) for classification. The resulting system leads to significant improvement over the baseline system. Here, acoustic features are extracted directly from speech, and in order to add temporal variations, delta and shifted delta cepstral parameters are added to the features. Our approach has led to a language identification performance of 78.6% among 11 languages using the OGI database and relative reduction error rate of 27.95% when compared with a baseline system employing GMM-UBM for classification.
Keywords :
Gaussian processes; cepstral analysis; classification; error statistics; feature extraction; language translation; mixture models; speech recognition; FSD; GMM-UBM; KLD; Kullback Leibler Divergence; OGI database; automatic translation technology; delta cepstral parameter; feature extraction; frame selection decoding; language classifier; language identification performance; relative reduction error rate; speech utterance; spoken language; Conferences; Electrical engineering; Frame Selection Decoding; GMM-UBM; Kullback Leibler Divergence; Language Identification; Shifted Delta Cepstrum; feature warping;
Conference_Titel :
Electrical Engineering (ICEE), 2015 23rd Iranian Conference on
Conference_Location :
Tehran
Print_ISBN :
978-1-4799-1971-0
DOI :
10.1109/IranianCEE.2015.7146206