Title :
Ontology-Based Text Classification into Dynamically Defined Topics
Author :
Allahyari, Mehdi ; Kochut, Krys J. ; Janik, Maciej
Author_Institution :
Dept. of Comput. Sci., Univ. of Georgia, Athens, GA, USA
Abstract :
We present a method for the automatic classification of text documents into a dynamically defined set of topics of interest. The proposed approach requires only a domain ontology and a set of user-defined classification topics, specified as contexts in the ontology. Our method is based on measuring the semantic similarity of the thematic graph created from a text document and the ontology sub-graphs resulting from the projection of the defined contexts. The domain ontology effectively becomes the classifier, where classification topics are expressed using the defined ontological contexts. In contrast to the traditional supervised categorization methods, the proposed method does not require a training set of documents. More importantly, our approach allows dynamically changing the classification topics without retraining of the classifier. In our experiments, we used the English language Wikipedia converted to an RDF ontology to categorize a corpus of current Web news documents into selection of topics of interest. The high accuracy achieved in our tests demonstrates the effectiveness of the proposed method, as well as the applicability of Wikipedia for semantic text categorization purposes.
Keywords :
ontologies (artificial intelligence); pattern classification; text analysis; English language Wikipedia; RDF ontology; Web news documents; classification topics; domain ontology; dynamically defined topics; ontology context; ontology subgraphs; ontology-based text classification; resource description framework; semantic similarity; semantic text categorization; supervised categorization methods; text documents classification; thematic graph; user-defined classification topics; Context; Electronic publishing; Encyclopedias; Internet; Ontologies; Semantics; Background Knowledge; Information Retrieval; Semantic Relatedness; Text Categorization; Topic filtering;
Conference_Titel :
Semantic Computing (ICSC), 2014 IEEE International Conference on
Conference_Location :
Newport Beach, CA
Print_ISBN :
978-1-4799-4002-8
DOI :
10.1109/ICSC.2014.51