DocumentCode :
1565272
Title :
Recognizing Transliterated Names from Chinese Texts Based on Support Vector Machines and Rules
Author :
Li, Lishuang ; Mao, Tingting ; Huang, Degen ; Li, Lihua
Author_Institution :
Dept. of Comput. Sci. & Eng., Dalian Univ. of Technol.
Volume :
2
fYear :
2005
Firstpage :
1135
Lastpage :
1138
Abstract :
According to the characteristics of transliterated names in Chinese texts, a method of automatic recognition of Chinese transliterated names combining support vector machines (SVMs) with rules is proposed. The attributes of feature vectors based on characters are extracted. A training set is established and the machine learning models of automatic identification of transliterated names are obtained by testing polynomial Kernel functions; the knowledge cannot be acquired completely if we only use the machine learning model, which will affect the recall. Through careful error analysis, the base of recognition-rules is constructed as post-processing steps to overcome the shortcoming of machine learning model. The results show that the method is efficient for identifying transliterated names from Chinese texts
Keywords :
character recognition; error analysis; learning (artificial intelligence); natural languages; support vector machines; Chinese texts; error analysis; machine learning models; polynomial Kernel functions; post-processing steps; support vector machines; transliterated names recognition; Character recognition; Computer science; Error analysis; Kernel; Machine learning; Machine learning algorithms; Support vector machine classification; Support vector machines; Testing; Text recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks and Brain, 2005. ICNN&B '05. International Conference on
Conference_Location :
Beijing
Print_ISBN :
0-7803-9422-4
Type :
conf
DOI :
10.1109/ICNNB.2005.1614816
Filename :
1614816
Link To Document :
بازگشت