DocumentCode
1987148
Title
Research on the System of Jointing Chinese Word Segmentation with Part-of-Speech Tagging
Author
Qin Li ; Wei Wei
Author_Institution
LMIB, Beihang Univ., Beijing, China
Volume
1
fYear
2013
fDate
28-29 Oct. 2013
Firstpage
387
Lastpage
390
Abstract
In this paper, we construct a system integrating Chinese word segmentation with part-of-speech tagging, by an approach based dictionary and statistics. In the early stage, many nodes are roughly segmented through searching word dictionary and used to generate possible paths as candidates, instead of choosing N-shortest paths. In the next stage, each path generated above has a cost, which is calculated by a statistical method. With improving the precision of combinational ambiguity, the optimum path that has lowest cost is chosen as the final result. The preliminary experiments show that the segmentation precision of the joint system based on hybrid approach is 94.06%, POS tagging precision 90.96%, and the recall and F-measure range from 96.86% to 95.44.0% and from 93.67% to 92.29% respectively. The Work of improving the performance of the system is still ongoing.
Keywords
combinatorial mathematics; dictionaries; natural language processing; statistical analysis; Chinese word segmentation; F-measure range; POS tagging precision; approach based dictionary; combinational ambiguity; hybrid approach; joint system; optimum path; part-of-speech tagging; segmentation precision; statistical method; word dictionary; Computers; Dictionaries; Educational institutions; Graphical models; Hidden Markov models; Joints; Tagging; combinational ambiguity; dictionary and statistics; part-of-speech tagging; word segmentation;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence and Design (ISCID), 2013 Sixth International Symposium on
Conference_Location
Hangzhou
Type
conf
DOI
10.1109/ISCID.2013.103
Filename
6805016
Link To Document