Title :
Ontology based semantic similarity comparison of documents
Author :
Oleshchuk, Vladimir ; Pedersen, Asle
Author_Institution :
Agder Univ. Coll., Grimstad, Norway
Abstract :
In this paper, we consider ontologies as knowledge structures that specify terms, their properties and relations among them to enable knowledge extraction from texts. We represent ontologies using a graph-based model that reflect semantic relationship between concepts and apply them to text analysis and comparison. Instead of raw document comparison we compare document footprint enhanced with concepts from the ontology (using different enhancement algorithms). The result of this process may be that documents not similar prior to the enhancement become similar (semantically on some abstraction level) after the enhancement. This is because the enhancement process may introduce in the document footprint abstract concepts from the ontology. Using the ontology we can enhance the foot-prints by adding concepts that are not present in the original document. We may use synonyms for a horizontal expansion and broader terms/superclasses/types in a vertical expansion or both for that matter.
Keywords :
knowledge acquisition; semantic networks; text analysis; abstraction level; broad terms; document footprint; enhancement algorithm; footprint abstract; graph-based model; horizontal expansion; knowledge extraction; knowledge structure; ontology; raw document; semantic relationship; superclasses; text analysis; vertical expansion; Algorithm design and analysis; Conferences; Couplings; Databases; Educational institutions; Expert systems; Filtering; Information retrieval; Ontologies; Text analysis;
Conference_Titel :
Database and Expert Systems Applications, 2003. Proceedings. 14th International Workshop on
Print_ISBN :
0-7695-1993-8
DOI :
10.1109/DEXA.2003.1232108