DocumentCode :
570232
Title :
Combining a double clustering approach with sentence simplification to produce highly informative multi-document summaries
Author :
Silveira, Sara Botelho ; Branco, António
Author_Institution :
Dept. de Inf., Univ. of Lisbon, Lisbon, Portugal
fYear :
2012
fDate :
8-10 Aug. 2012
Firstpage :
482
Lastpage :
489
Abstract :
This paper presents a method for extractive multi-document summarization that explores a two-phase clustering approach that, combined with a sentence simplification procedure, aims to generate more useful summaries. First, sentences are clustered by similarity, and one sentence per cluster is selected, to reduce redundancy. Then, in order to group them according to topics, those sentences are clustered considering the collection of keywords. Finally, the summarization process includes a sentence simplification step, which aims not only to create simpler and more incisive sentences, but also to make room for the inclusion of further relevant content in the summary. Evaluation reveals that the approach pursued produces highly informative summaries, containing relevant data and no repeated information.
Keywords :
pattern clustering; text analysis; double clustering approach; multidocument summarization; sentence simplification; two-phase clustering approach; Abstracts; Clustering algorithms; Humans; Measurement; Organizations; Pragmatics; Redundancy;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Reuse and Integration (IRI), 2012 IEEE 13th International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4673-2282-9
Electronic_ISBN :
978-1-4673-2283-6
Type :
conf
DOI :
10.1109/IRI.2012.6303047
Filename :
6303047
Link To Document :
بازگشت