DocumentCode :
405640
Title :
Automatic language identification based on GMBM-UBBM
Author :
Qu, Dan ; Wang, Bingxi
Author_Institution :
Inst. of Inf. Eng. of Inf. Eng. Univ., Zhengzhou, China
fYear :
2003
fDate :
26-29 Oct. 2003
Firstpage :
722
Lastpage :
727
Abstract :
Gaussian mixture model is an effective method for speaker-independent language identification tasks. Gaussian mixture bigram model integrates bigram time correlation to extend the GMM. A language identification algorithm of GMBM-UBBM is proposed based on GMBM and GMM-UBM and some experiments are conducted using OGI-TS multilanguage telephone speech corpus. Simulation results demonstrate the effectiveness of GMBM-UBBM for language identification tasks and use of this model allows the proposed system to distinguish between the three languages with maximal 4.378% improvement accuracy superior to conventional GMM-UBM.
Keywords :
Gaussian processes; natural languages; speaker recognition; GMBM-UBBM; Gaussian mixture bigram model; OGI-TS multilanguage telephone speech corpus; bigram time correlation; language identification algorithm; speaker-independent language; Banking; Emergency services; Information retrieval; Internet; Natural languages; Probability density function; Speech; Statistics; Telephony; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003 International Conference on
Conference_Location :
Beijing, China
Print_ISBN :
0-7803-7902-0
Type :
conf
DOI :
10.1109/NLPKE.2003.1276000
Filename :
1276000
Link To Document :
بازگشت