DocumentCode :
3264543
Title :
Applying RDF Ontologies to Improve Text Classification
Author :
Xiaoyue, Wang ; Rujiang, Bai
Author_Institution :
Shandong Univ. of Technol. Libr., Zibo, China
Volume :
2
fYear :
2009
fDate :
6-7 June 2009
Firstpage :
118
Lastpage :
121
Abstract :
Current classification methods are based on the ldquobag of wordsrdquo (BOW) representation, which only accounts for term frequency in the documents, and ignores important semantic relationships between key terms. In this paper, we proposed a system that uses ontologies and natural language processing techniques to index texts. Traditional BOW matrix is replaced by ldquoBag of Conceptsrdquo (BOC). For this purpose, we developed fully automated methods for mapping keywords to their corresponding ontology concepts. Support vector machine a successful machine learning technique is used for classification. Experimental results shows that our proposed method dose improve text classification performance significantly.
Keywords :
learning (artificial intelligence); natural language processing; ontologies (artificial intelligence); support vector machines; text analysis; BOW matrix; RDF ontologies; bag of words representation; machine learning technique; natural language processing techniques; support vector machine; text classification; Electronic mail; Frequency; Indexing; Libraries; Ontologies; Resource description framework; Support vector machine classification; Support vector machines; Text categorization; Vocabulary; RDF; SVM; ontology; text classification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Natural Computing, 2009. CINC '09. International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-0-7695-3645-3
Type :
conf
DOI :
10.1109/CINC.2009.115
Filename :
5231026
Link To Document :
بازگشت