DocumentCode :
2425668
Title :
The Research on the Application of Text Clustering and Natural Language Understanding in Automatic Abstracting
Author :
Guo, Qinglin ; Li, Cunbin
Author_Institution :
North China Electr. Power Univ., Beijing
Volume :
4
fYear :
2007
fDate :
24-27 Aug. 2007
Firstpage :
92
Lastpage :
96
Abstract :
A method of realization of automatic abstracting based on text clustering and natural language understanding is brought forward, aimed at overcoming shortages of some current methods. The method makes use of text clustering and can realize automatic abstracting of multi-documents. The algorithm of twice word segmentation based on the title and first-sentences in paragraphs is brought forward. Its precision and recall is above 95% for a specific domain on plastics, an automatic abstracting system named TCAAS is implemented. The precision and recall of multi-document´s automatic abstracting is above 75% And experiments do prove that it is feasible to use the method to develop a domain automatic abstracting system, which is valuable for further study in more depth.
Keywords :
abstracting; natural languages; text analysis; multidocuments automatic abstracting; natural language understanding; text clustering; word segmentation; Application software; Automatic logic units; Clustering algorithms; Computer science; Databases; Dictionaries; Natural languages; Plastics; Tagging; Web server;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on
Conference_Location :
Haikou
Print_ISBN :
978-0-7695-2874-8
Type :
conf
DOI :
10.1109/FSKD.2007.584
Filename :
4406360
Link To Document :
بازگشت