DocumentCode
2181259
Title
An Automatic Thai Text Summarization Using Topic Sensitive PageRank
Author
Chongsuntornsri, Aekkasit ; Sornil, Ohm
Author_Institution
Dept. of Comput. Sci., Nat. Inst. of Dev. Adm., Bangkok
fYear
2006
fDate
Oct. 18 2006-Sept. 20 2006
Firstpage
547
Lastpage
552
Abstract
The continuing growth of World Wide Web and on-line text collections makes a large volume of information available to users. Automatic text summarization allows users to quickly understand documents. In this paper, we propose an automated technique for single document summary extraction in Thai language which combines content-based and graph-based features and introduce the Topic Sensitive PageRank algorithm as a technique for ranking text segments. A series of experiments are performed using a Thai document collection. The results show the superiority of the proposed technique over reference systems
Keywords
feature extraction; graph theory; text analysis; Thai document collection; Thai language; World Wide Web; automatic Thai text summarization; content-based features; graph-based features; on-line text collections; ranking text segments; single document summary extraction; topic sensitive pagerank; Classification tree analysis; Computer science; Context modeling; Data mining; Matrix decomposition; Natural languages; Position measurement; Probability; Supervised learning; Web sites; Thai Text Summarization; Topic Sensitive PageRank;
fLanguage
English
Publisher
ieee
Conference_Titel
Communications and Information Technologies, 2006. ISCIT '06. International Symposium on
Conference_Location
Bangkok
Print_ISBN
0-7803-9741-X
Electronic_ISBN
0-7803-9741-X
Type
conf
DOI
10.1109/ISCIT.2006.340009
Filename
4141445
Link To Document