Title :
The Research on the Application of Text Clustering and Natural Language Understanding in Automatic Abstracting
Author :
Guo, Qinglin ; Li, Cunbin
Author_Institution :
North China Electr. Power Univ., Beijing
Abstract :
A method of realization of automatic abstracting based on text clustering and natural language understanding is brought forward, aimed at overcoming shortages of some current methods. The method makes use of text clustering and can realize automatic abstracting of multi-documents. The algorithm of twice word segmentation based on the title and first-sentences in paragraphs is brought forward. Its precision and recall is above 95% for a specific domain on plastics, an automatic abstracting system named TCAAS is implemented. The precision and recall of multi-document´s automatic abstracting is above 75% And experiments do prove that it is feasible to use the method to develop a domain automatic abstracting system, which is valuable for further study in more depth.
Keywords :
abstracting; natural languages; text analysis; multidocuments automatic abstracting; natural language understanding; text clustering; word segmentation; Application software; Automatic logic units; Clustering algorithms; Computer science; Databases; Dictionaries; Natural languages; Plastics; Tagging; Web server;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on
Conference_Location :
Haikou
Print_ISBN :
978-0-7695-2874-8
DOI :
10.1109/FSKD.2007.584