Title :
Combining a double clustering approach with sentence simplification to produce highly informative multi-document summaries
Author :
Silveira, Sara Botelho ; Branco, António
Author_Institution :
Dept. de Inf., Univ. of Lisbon, Lisbon, Portugal
Abstract :
This paper presents a method for extractive multi-document summarization that explores a two-phase clustering approach that, combined with a sentence simplification procedure, aims to generate more useful summaries. First, sentences are clustered by similarity, and one sentence per cluster is selected, to reduce redundancy. Then, in order to group them according to topics, those sentences are clustered considering the collection of keywords. Finally, the summarization process includes a sentence simplification step, which aims not only to create simpler and more incisive sentences, but also to make room for the inclusion of further relevant content in the summary. Evaluation reveals that the approach pursued produces highly informative summaries, containing relevant data and no repeated information.
Keywords :
pattern clustering; text analysis; double clustering approach; multidocument summarization; sentence simplification; two-phase clustering approach; Abstracts; Clustering algorithms; Humans; Measurement; Organizations; Pragmatics; Redundancy;
Conference_Titel :
Information Reuse and Integration (IRI), 2012 IEEE 13th International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4673-2282-9
Electronic_ISBN :
978-1-4673-2283-6
DOI :
10.1109/IRI.2012.6303047