DocumentCode
2425668
Title
The Research on the Application of Text Clustering and Natural Language Understanding in Automatic Abstracting
Author
Guo, Qinglin ; Li, Cunbin
Author_Institution
North China Electr. Power Univ., Beijing
Volume
4
fYear
2007
fDate
24-27 Aug. 2007
Firstpage
92
Lastpage
96
Abstract
A method of realization of automatic abstracting based on text clustering and natural language understanding is brought forward, aimed at overcoming shortages of some current methods. The method makes use of text clustering and can realize automatic abstracting of multi-documents. The algorithm of twice word segmentation based on the title and first-sentences in paragraphs is brought forward. Its precision and recall is above 95% for a specific domain on plastics, an automatic abstracting system named TCAAS is implemented. The precision and recall of multi-document´s automatic abstracting is above 75% And experiments do prove that it is feasible to use the method to develop a domain automatic abstracting system, which is valuable for further study in more depth.
Keywords
abstracting; natural languages; text analysis; multidocuments automatic abstracting; natural language understanding; text clustering; word segmentation; Application software; Automatic logic units; Clustering algorithms; Computer science; Databases; Dictionaries; Natural languages; Plastics; Tagging; Web server;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on
Conference_Location
Haikou
Print_ISBN
978-0-7695-2874-8
Type
conf
DOI
10.1109/FSKD.2007.584
Filename
4406360
Link To Document