Title :
Automatic Language Identification Using the Frequencies of Occurrence of Phones
Author :
Dai, Guannan ; Wang, Bingxi ; Qu, Dan ; Zhang, Wenlin
Author_Institution :
Dept. of Inf. Sci., Inf. Eng. Univ., Zhengzhou
Abstract :
Phonetic inventories differ from language to language. Even when languages have identical phones, the frequencies of occurrence of phones differ across languages. It´s difficult to introduce new languages when the language identification system used phones label. The frequencies of occurrence of phones were trained by Gaussian mixture model and vector quantization. The method of occurring of phones and three improved methods were compared. The results show that the frequencies of occurrence of phones are effective in language identification. The performance of GMM is better than the performance of VQ. And the third modified method of joint-usefulness is better than the other methods
Keywords :
Gaussian processes; natural languages; speech processing; speech recognition; vector quantisation; Gaussian mixture model; automatic language identification; joint usefulness; phone label; phone occurrence; phonetics; vector quantization; Auditory system; Data mining; Frequency; Histograms; Humans; Information science; Natural languages; Signal processing; Speech processing; Vector quantization; Automatic Languange Identification (LID); Mixed Training Model(MTM); Occurring of Phones;
Conference_Titel :
Intelligent Control and Automation, 2006. WCICA 2006. The Sixth World Congress on
Conference_Location :
Dalian
Print_ISBN :
1-4244-0332-4
DOI :
10.1109/WCICA.2006.1713902