Title :
Semantic Summarization of Web Documents
Author :
Acierno, A.D. ; Moscato, V. ; Persia, F. ; Picariello, A. ; Penta, A.
Author_Institution :
ISA-CNR, Avellino, Italy
Abstract :
Documents´ summarization techniques automatically extract relevant information from different sources with respect to a list of topics: they can be profitably used by a variety of applications and in particular for automatic indexing and categorization in order to facilitate the production and delivery of new multimedia contents. In this paper we propose a novel approach for summarizing documents retrieved from the Internet: we propose to capture the semantic nature of a document, expressed in natural language, in order to retrieve a number of RDF triplets and to clusterize these ones aggregating similar information. An overview of the system and some preliminary results are described.
Keywords :
Internet; document handling; indexing; multimedia computing; natural languages; Internet; RDF triplets; Web documents; automatic categorization; automatic indexing; documents summarization; multimedia contents; natural language; semantic summarization; Buildings; Cities and towns; Computer crashes; HTML; Resource description framework; Semantics; Terrorism;
Conference_Titel :
Semantic Computing (ICSC), 2010 IEEE Fourth International Conference on
Conference_Location :
Pittsburgh, PA
Print_ISBN :
978-1-4244-7912-2
Electronic_ISBN :
978-0-7695-4154-9
DOI :
10.1109/ICSC.2010.28