• DocumentCode
    2425668
  • Title

    The Research on the Application of Text Clustering and Natural Language Understanding in Automatic Abstracting

  • Author

    Guo, Qinglin ; Li, Cunbin

  • Author_Institution
    North China Electr. Power Univ., Beijing
  • Volume
    4
  • fYear
    2007
  • fDate
    24-27 Aug. 2007
  • Firstpage
    92
  • Lastpage
    96
  • Abstract
    A method of realization of automatic abstracting based on text clustering and natural language understanding is brought forward, aimed at overcoming shortages of some current methods. The method makes use of text clustering and can realize automatic abstracting of multi-documents. The algorithm of twice word segmentation based on the title and first-sentences in paragraphs is brought forward. Its precision and recall is above 95% for a specific domain on plastics, an automatic abstracting system named TCAAS is implemented. The precision and recall of multi-document´s automatic abstracting is above 75% And experiments do prove that it is feasible to use the method to develop a domain automatic abstracting system, which is valuable for further study in more depth.
  • Keywords
    abstracting; natural languages; text analysis; multidocuments automatic abstracting; natural language understanding; text clustering; word segmentation; Application software; Automatic logic units; Clustering algorithms; Computer science; Databases; Dictionaries; Natural languages; Plastics; Tagging; Web server;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on
  • Conference_Location
    Haikou
  • Print_ISBN
    978-0-7695-2874-8
  • Type

    conf

  • DOI
    10.1109/FSKD.2007.584
  • Filename
    4406360