Title :
An Ontology-Based Approach to Text Summarization
Author :
Hennig, Leonhard ; Umbrath, Winfried ; Wetzker, Robert
Author_Institution :
DAI Labor, Tech. Univ. Berlin, Berlin
Abstract :
Extractive text summarization aims to create a condensed version of one or more source documents by selecting the most informative sentences. Research in text summarization has therefore often focused on measures of the usefulness of sentences for a summary. We present an approach to sentence extraction that maps sentences to nodes of a hierarchical ontology. By considering ontology attributes we are able to improve the semantic representation of a sentence´s information content. The classifier that maps sentences to the taxonomy is trained using search engines and is therefore very flexible and not bound to a specific domain. In our experiments, we train an SVM classifier to identify summary sentences using ontology-based sentence features. Our experimental results show that the ontology-based extraction of sentences outperforms baseline classifiers, leading to higher Rouge scores of summary extracts.
Keywords :
feature extraction; ontologies (artificial intelligence); pattern classification; search engines; support vector machines; text analysis; SVM classifier; extractive text summarization; hierarchical ontology; search engines; semantic representation; Abstracts; Data mining; Intelligent agent; Ontologies; Search engines; Support vector machine classification; Support vector machines; Taxonomy; Tornadoes; Tropical cyclones; hierarchical classification; ontology; sentence extraction; summarization;
Conference_Titel :
Web Intelligence and Intelligent Agent Technology, 2008. WI-IAT '08. IEEE/WIC/ACM International Conference on
Conference_Location :
Sydney, NSW
Print_ISBN :
978-0-7695-3496-1
DOI :
10.1109/WIIAT.2008.175