Title :
An Efficient Corpus Based Part-of-Speech Tagging with GEP
Author :
Lv, Chengyao ; Liu, Huihua ; Dong, Yuanxing
Author_Institution :
Sch. of Foreign Language, China Univ. of Geosci., Wuhan, China
Abstract :
Text corpora which are tagged with part-of-speech (pos) information are useful in many areas of linguistic research. This paper proposes a model of Genetic Expression Programming (GEP) for pos tagging. GEP is used to search for appropriate structures in function space. After the evolution of sequence of tags, GEP can find the best individual as solution. Before simulation, a set of appropriate parameters of algorithm is fitted. Experiments on Brown Corpus show that the proposed model can achieve higher accuracy rate than Genetic Algorithm model and HMM model.
Keywords :
identification technology; natural language processing; optimisation; search problems; text analysis; Brown corpus; GEP; corpus based part-of-speech tagging; genetic expression programming; part-of-speech information; pos tagging; text corpora;
Conference_Titel :
Semantics Knowledge and Grid (SKG), 2010 Sixth International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-8125-5
Electronic_ISBN :
978-0-7695-4189-1
DOI :
10.1109/SKG.2010.42