DocumentCode
3102179
Title
Subtopic-based multi-document summarization
Author
Dai, Lin ; Tang, Ji-liang ; Xia, Yun-qing
Author_Institution
Sch. of Comput. Sci., Beijing Inst. of Technol., Beijing, China
Volume
6
fYear
2009
fDate
12-15 July 2009
Firstpage
3505
Lastpage
3510
Abstract
This paper proposes a novel approach for multi-document summarization based on subtopic segmentation. It firstly detects the subtopics in a topic, and then finds the central sentence for each subtopic. The sentences are scored based on their importance in the document and in the subtopic. Two anti-redundancy strategies are used to extract sentences to form summarization. Since our approach is intrinsically incremental, it is effective when new documents are added to the document set. Experimental results indicate that the proposed approach is effective and efficient.
Keywords
text analysis; antiredundancy strategy; subtopic segmentation; subtopic-based multidocument summarization; Computer science; Cybernetics; Data mining; Information science; Information technology; Laboratories; Learning systems; Machine learning; Natural language processing; Partial response channels; Anti-redundancy strategy; Multi-document summarization; Topic Detection and Tracking; Topic segmentation;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2009 International Conference on
Conference_Location
Baoding
Print_ISBN
978-1-4244-3702-3
Electronic_ISBN
978-1-4244-3703-0
Type
conf
DOI
10.1109/ICMLC.2009.5212767
Filename
5212767
Link To Document