• DocumentCode
    2181259
  • Title

    An Automatic Thai Text Summarization Using Topic Sensitive PageRank

  • Author

    Chongsuntornsri, Aekkasit ; Sornil, Ohm

  • Author_Institution
    Dept. of Comput. Sci., Nat. Inst. of Dev. Adm., Bangkok
  • fYear
    2006
  • fDate
    Oct. 18 2006-Sept. 20 2006
  • Firstpage
    547
  • Lastpage
    552
  • Abstract
    The continuing growth of World Wide Web and on-line text collections makes a large volume of information available to users. Automatic text summarization allows users to quickly understand documents. In this paper, we propose an automated technique for single document summary extraction in Thai language which combines content-based and graph-based features and introduce the Topic Sensitive PageRank algorithm as a technique for ranking text segments. A series of experiments are performed using a Thai document collection. The results show the superiority of the proposed technique over reference systems
  • Keywords
    feature extraction; graph theory; text analysis; Thai document collection; Thai language; World Wide Web; automatic Thai text summarization; content-based features; graph-based features; on-line text collections; ranking text segments; single document summary extraction; topic sensitive pagerank; Classification tree analysis; Computer science; Context modeling; Data mining; Matrix decomposition; Natural languages; Position measurement; Probability; Supervised learning; Web sites; Thai Text Summarization; Topic Sensitive PageRank;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications and Information Technologies, 2006. ISCIT '06. International Symposium on
  • Conference_Location
    Bangkok
  • Print_ISBN
    0-7803-9741-X
  • Electronic_ISBN
    0-7803-9741-X
  • Type

    conf

  • DOI
    10.1109/ISCIT.2006.340009
  • Filename
    4141445