DocumentCode :
3102179
Title :
Subtopic-based multi-document summarization
Author :
Dai, Lin ; Tang, Ji-liang ; Xia, Yun-qing
Author_Institution :
Sch. of Comput. Sci., Beijing Inst. of Technol., Beijing, China
Volume :
6
fYear :
2009
fDate :
12-15 July 2009
Firstpage :
3505
Lastpage :
3510
Abstract :
This paper proposes a novel approach for multi-document summarization based on subtopic segmentation. It firstly detects the subtopics in a topic, and then finds the central sentence for each subtopic. The sentences are scored based on their importance in the document and in the subtopic. Two anti-redundancy strategies are used to extract sentences to form summarization. Since our approach is intrinsically incremental, it is effective when new documents are added to the document set. Experimental results indicate that the proposed approach is effective and efficient.
Keywords :
text analysis; antiredundancy strategy; subtopic segmentation; subtopic-based multidocument summarization; Computer science; Cybernetics; Data mining; Information science; Information technology; Laboratories; Learning systems; Machine learning; Natural language processing; Partial response channels; Anti-redundancy strategy; Multi-document summarization; Topic Detection and Tracking; Topic segmentation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics, 2009 International Conference on
Conference_Location :
Baoding
Print_ISBN :
978-1-4244-3702-3
Electronic_ISBN :
978-1-4244-3703-0
Type :
conf
DOI :
10.1109/ICMLC.2009.5212767
Filename :
5212767
Link To Document :
بازگشت