DocumentCode
3632601
Title
Document Visualization Based on Semantic Graphs
Author
Delia Rusu;Bla Fortuna;Dunja Mladenic;Marko Grobelnik;Ruben Sipo
Author_Institution
Jozef Stefan Inst., Ljubljana, Slovenia
fYear
2009
Firstpage
292
Lastpage
297
Abstract
In this paper, we present a document visualization technique for data analysis based on the semantic representation of text in the form of a directed graph, referred to as semantic graph. It is derived using natural language processing as follows. Firstly subject– verb – object triplets are automatically extracted from the Penn Treebank parse tree obtained for each sentence in the document. Secondly, the triplets are further enhanced by linking them to their corresponding co-referenced named entity, by resolving pronominal anaphors as well as attaching the associated WordNet synset. Starting from the document´s semantic graph and the list of extracted triplets we automatically generate the document summary, for which we also derive the semantic representation.
Keywords
"Data visualization","Tree graphs","Joining processes","Data analysis","Data mining","Ontologies","Performance analysis","Natural language processing","Merging","Feature extraction"
Publisher
ieee
Conference_Titel
Information Visualisation, 2009 13th International Conference
ISSN
1550-6037
Print_ISBN
978-0-7695-3733-7
Type
conf
DOI
10.1109/IV.2009.57
Filename
5190878
Link To Document