DocumentCode :
3520948
Title :
An Efficient Corpus Based Part-of-Speech Tagging with GEP
Author :
Lv, Chengyao ; Liu, Huihua ; Dong, Yuanxing
Author_Institution :
Sch. of Foreign Language, China Univ. of Geosci., Wuhan, China
fYear :
2010
fDate :
1-3 Nov. 2010
Firstpage :
289
Lastpage :
292
Abstract :
Text corpora which are tagged with part-of-speech (pos) information are useful in many areas of linguistic research. This paper proposes a model of Genetic Expression Programming (GEP) for pos tagging. GEP is used to search for appropriate structures in function space. After the evolution of sequence of tags, GEP can find the best individual as solution. Before simulation, a set of appropriate parameters of algorithm is fitted. Experiments on Brown Corpus show that the proposed model can achieve higher accuracy rate than Genetic Algorithm model and HMM model.
Keywords :
identification technology; natural language processing; optimisation; search problems; text analysis; Brown corpus; GEP; corpus based part-of-speech tagging; genetic expression programming; part-of-speech information; pos tagging; text corpora;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Semantics Knowledge and Grid (SKG), 2010 Sixth International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-8125-5
Electronic_ISBN :
978-0-7695-4189-1
Type :
conf
DOI :
10.1109/SKG.2010.42
Filename :
5663526
Link To Document :
بازگشت