Title :
Study of Word-Based Chinese Document Experimental System and Chinese Free-Text Information Extraction Experiment Based on It
Author :
Liu, Qian ; Jiao, Hui ; Jia, Hui-bo
Author_Institution :
State Key Lab. of Precision Meas. Technol. & Instrum., Beijing
Abstract :
This paper presents a word-based Chinese document experimental system which is aimed to make Chinese information processing technology to develop on a more reliable and more efficient basis. This system implements the document storage and processing format, both of which are based on the smallest information carrier: Chinese word. Further an IE algorithm with two steps strategy for the Chinese free text is introduced. And then taking this document system as experimental platform, choosing the abstract part of Chinese Sci_Tech journals as the free text, the IE experiment which is conducted and get good results: accuracy ratio P is 95.03%, recall ratio R is 91.40% and F-value is 93.18% From the experimental results, we can see that the Word-based Chinese Document System designed by us can promote the development of Chinese Information Processing technology to more advanced application stages.
Keywords :
information retrieval; natural language processing; text analysis; Chinese free-text information extraction experiment; Chinese information processing technology; word-based Chinese document experimental system; Algorithm design and analysis; Data mining; Information processing; Instruments; Knowledge engineering; Laboratories; Machine learning; Machine learning algorithms; Resumes; Training data;
Conference_Titel :
Natural Computation, 2007. ICNC 2007. Third International Conference on
Conference_Location :
Haikou
Print_ISBN :
978-0-7695-2875-5
DOI :
10.1109/ICNC.2007.688