• DocumentCode
    2743601
  • Title

    An Automatic Text Summarization Approach using Content-Based and Graph-Based Characteristics

  • Author

    Sornil, Ohm ; Gree-ut, Kornnika

  • Author_Institution
    Dept. of Comput. Sci., National Inst. of Dev. Adm., Bangkok
  • fYear
    2006
  • fDate
    7-9 June 2006
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    The continuing growth of World Wide Web and on-line text collections makes a large volume of information available to users. Automatic text summarization allows users to quickly understand documents. In this paper, we propose an automated technique for single document summarization which combines content-based and graph-based approaches and introduce the Hopfield network algorithm as a technique for ranking text segments. A series of experiments are performed using the DUC collection and a Thai-document collection. The results show the superiority of the proposed technique over reference systems, in addition the Hopfield network algorithm on undirected graph is shown to be the best text segment ranking algorithm in the study
  • Keywords
    Hopfield neural nets; abstracting; graph theory; text analysis; DUC collection; Hopfield network algorithm; Thai-document collection; World Wide Web; automatic text summarization; content-based approach; content-based characteristics; document summarization; graph ranking; graph-based approach; graph-based characteristics; on-line text collections; text segment ranking algorithm; undirected graph; Computer science; Data mining; Hopfield neural networks; Matrix decomposition; Natural languages; Neural networks; Position measurement; Singular value decomposition; Volume measurement; Web sites; Hopfield Network algorithm; graph ranking; text summarization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cybernetics and Intelligent Systems, 2006 IEEE Conference on
  • Conference_Location
    Bangkok
  • Print_ISBN
    1-4244-0023-6
  • Type

    conf

  • DOI
    10.1109/ICCIS.2006.252361
  • Filename
    4017920