DocumentCode :
3726586
Title :
Genetic Clustering Algorithm for Extractive Text Summarization
Author :
Sebastian Suarez Benjumea; Le?n
Author_Institution :
Nat. Univ. of Colombia, Bogota, Colombia
fYear :
2015
Firstpage :
949
Lastpage :
956
Abstract :
Automatic text summarization has become a relevant topic due to the information overload. This automatization aims to help humans and machines to deal with the vast amount of text data (structured and un-structured) offered on the web and deep web. In this paper a novel approach for automatic extractive text summarization called SENCLUS is presented. Using a genetic clustering algorithm, SENCLUS clusters the sentences as close representation of the text topics using a fitness function based on redundancy and coverage, and applies a scoring function to select the most relevant sentences of each topic to be part of the extractive summary. The approach was validated using the DUC2002 data set and ROUGE summary quality measures. The results shows that the approach is representative against the state of the art methods for extractive automatic text summarization.
Keywords :
"Genetics","Clustering algorithms","Genetic algorithms","Optimization","Writing","Redundancy","Approximation methods"
Publisher :
ieee
Conference_Titel :
Computational Intelligence, 2015 IEEE Symposium Series on
Print_ISBN :
978-1-4799-7560-0
Type :
conf
DOI :
10.1109/SSCI.2015.139
Filename :
7376714
Link To Document :
بازگشت