Title :
Subtopic-based multi-document summarization
Author :
Dai, Lin ; Tang, Ji-liang ; Xia, Yun-qing
Author_Institution :
Sch. of Comput. Sci., Beijing Inst. of Technol., Beijing, China
Abstract :
This paper proposes a novel approach for multi-document summarization based on subtopic segmentation. It firstly detects the subtopics in a topic, and then finds the central sentence for each subtopic. The sentences are scored based on their importance in the document and in the subtopic. Two anti-redundancy strategies are used to extract sentences to form summarization. Since our approach is intrinsically incremental, it is effective when new documents are added to the document set. Experimental results indicate that the proposed approach is effective and efficient.
Keywords :
text analysis; antiredundancy strategy; subtopic segmentation; subtopic-based multidocument summarization; Computer science; Cybernetics; Data mining; Information science; Information technology; Laboratories; Learning systems; Machine learning; Natural language processing; Partial response channels; Anti-redundancy strategy; Multi-document summarization; Topic Detection and Tracking; Topic segmentation;
Conference_Titel :
Machine Learning and Cybernetics, 2009 International Conference on
Conference_Location :
Baoding
Print_ISBN :
978-1-4244-3702-3
Electronic_ISBN :
978-1-4244-3703-0
DOI :
10.1109/ICMLC.2009.5212767