Title :
An Automatic Thai Text Summarization Using Topic Sensitive PageRank
Author :
Chongsuntornsri, Aekkasit ; Sornil, Ohm
Author_Institution :
Dept. of Comput. Sci., Nat. Inst. of Dev. Adm., Bangkok
fDate :
Oct. 18 2006-Sept. 20 2006
Abstract :
The continuing growth of World Wide Web and on-line text collections makes a large volume of information available to users. Automatic text summarization allows users to quickly understand documents. In this paper, we propose an automated technique for single document summary extraction in Thai language which combines content-based and graph-based features and introduce the Topic Sensitive PageRank algorithm as a technique for ranking text segments. A series of experiments are performed using a Thai document collection. The results show the superiority of the proposed technique over reference systems
Keywords :
feature extraction; graph theory; text analysis; Thai document collection; Thai language; World Wide Web; automatic Thai text summarization; content-based features; graph-based features; on-line text collections; ranking text segments; single document summary extraction; topic sensitive pagerank; Classification tree analysis; Computer science; Context modeling; Data mining; Matrix decomposition; Natural languages; Position measurement; Probability; Supervised learning; Web sites; Thai Text Summarization; Topic Sensitive PageRank;
Conference_Titel :
Communications and Information Technologies, 2006. ISCIT '06. International Symposium on
Conference_Location :
Bangkok
Print_ISBN :
0-7803-9741-X
Electronic_ISBN :
0-7803-9741-X
DOI :
10.1109/ISCIT.2006.340009