DocumentCode :
2181259
Title :
An Automatic Thai Text Summarization Using Topic Sensitive PageRank
Author :
Chongsuntornsri, Aekkasit ; Sornil, Ohm
Author_Institution :
Dept. of Comput. Sci., Nat. Inst. of Dev. Adm., Bangkok
fYear :
2006
fDate :
Oct. 18 2006-Sept. 20 2006
Firstpage :
547
Lastpage :
552
Abstract :
The continuing growth of World Wide Web and on-line text collections makes a large volume of information available to users. Automatic text summarization allows users to quickly understand documents. In this paper, we propose an automated technique for single document summary extraction in Thai language which combines content-based and graph-based features and introduce the Topic Sensitive PageRank algorithm as a technique for ranking text segments. A series of experiments are performed using a Thai document collection. The results show the superiority of the proposed technique over reference systems
Keywords :
feature extraction; graph theory; text analysis; Thai document collection; Thai language; World Wide Web; automatic Thai text summarization; content-based features; graph-based features; on-line text collections; ranking text segments; single document summary extraction; topic sensitive pagerank; Classification tree analysis; Computer science; Context modeling; Data mining; Matrix decomposition; Natural languages; Position measurement; Probability; Supervised learning; Web sites; Thai Text Summarization; Topic Sensitive PageRank;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications and Information Technologies, 2006. ISCIT '06. International Symposium on
Conference_Location :
Bangkok
Print_ISBN :
0-7803-9741-X
Electronic_ISBN :
0-7803-9741-X
Type :
conf
DOI :
10.1109/ISCIT.2006.340009
Filename :
4141445
Link To Document :
بازگشت