Title :
Research on the System of Jointing Chinese Word Segmentation with Part-of-Speech Tagging
Author :
Qin Li ; Wei Wei
Author_Institution :
LMIB, Beihang Univ., Beijing, China
Abstract :
In this paper, we construct a system integrating Chinese word segmentation with part-of-speech tagging, by an approach based dictionary and statistics. In the early stage, many nodes are roughly segmented through searching word dictionary and used to generate possible paths as candidates, instead of choosing N-shortest paths. In the next stage, each path generated above has a cost, which is calculated by a statistical method. With improving the precision of combinational ambiguity, the optimum path that has lowest cost is chosen as the final result. The preliminary experiments show that the segmentation precision of the joint system based on hybrid approach is 94.06%, POS tagging precision 90.96%, and the recall and F-measure range from 96.86% to 95.44.0% and from 93.67% to 92.29% respectively. The Work of improving the performance of the system is still ongoing.
Keywords :
combinatorial mathematics; dictionaries; natural language processing; statistical analysis; Chinese word segmentation; F-measure range; POS tagging precision; approach based dictionary; combinational ambiguity; hybrid approach; joint system; optimum path; part-of-speech tagging; segmentation precision; statistical method; word dictionary; Computers; Dictionaries; Educational institutions; Graphical models; Hidden Markov models; Joints; Tagging; combinational ambiguity; dictionary and statistics; part-of-speech tagging; word segmentation;
Conference_Titel :
Computational Intelligence and Design (ISCID), 2013 Sixth International Symposium on
Conference_Location :
Hangzhou
DOI :
10.1109/ISCID.2013.103