DocumentCode :
3632601
Title :
Document Visualization Based on Semantic Graphs
Author :
Delia Rusu;Blaž Fortuna;Dunja Mladenic;Marko Grobelnik;Ruben Sipoš
Author_Institution :
Jozef Stefan Inst., Ljubljana, Slovenia
fYear :
2009
Firstpage :
292
Lastpage :
297
Abstract :
In this paper, we present a document visualization technique for data analysis based on the semantic representation of text in the form of a directed graph, referred to as semantic graph. It is derived using natural language processing as follows. Firstly subject– verb – object triplets are automatically extracted from the Penn Treebank parse tree obtained for each sentence in the document. Secondly, the triplets are further enhanced by linking them to their corresponding co-referenced named entity, by resolving pronominal anaphors as well as attaching the associated WordNet synset. Starting from the document´s semantic graph and the list of extracted triplets we automatically generate the document summary, for which we also derive the semantic representation.
Keywords :
"Data visualization","Tree graphs","Joining processes","Data analysis","Data mining","Ontologies","Performance analysis","Natural language processing","Merging","Feature extraction"
Publisher :
ieee
Conference_Titel :
Information Visualisation, 2009 13th International Conference
ISSN :
1550-6037
Print_ISBN :
978-0-7695-3733-7
Type :
conf
DOI :
10.1109/IV.2009.57
Filename :
5190878
Link To Document :
بازگشت