DocumentCode :
2625046
Title :
Improving the classification of unknown documents by concept graph
Author :
Mohaqeqi, Morteza ; Soltanpoor, Reza ; Shakery, Azadeh
Author_Institution :
ECE Dept., Univ. of Tehran, Tehran, Iran
fYear :
2009
fDate :
20-21 Oct. 2009
Firstpage :
259
Lastpage :
264
Abstract :
Concept graph is a graph that represents the relationships between language concepts. In this structure the relationship between any two words is demonstrated by a weighted edge such that the value of this weight is interpreted as the degree of the relevance of two words. Having this graph, we can obtain most relevant words to a special term. In this paper, we propose a method for improving the classification of documents from unknown sources by means of concept graph. In our method, initially some features are selected from a training set by a well-known feature selection algorithm. Then, by extracting most relevant words for each class from the concept graph, a more effective feature set is produced. Our experimental results identify an improvement of 1% and 8% in precision and recall measures, respectively.
Keywords :
feature extraction; graph theory; information retrieval; pattern classification; concept graph; feature extraction; feature selection algorithm; unknown document classification; Artificial intelligence; Classification algorithms; Content based retrieval; Data mining; Indexing; Information retrieval; Knowledge representation; Ontologies; Testing; Text categorization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Conference, 2009. CSICC 2009. 14th International CSI
Conference_Location :
Tehran
Print_ISBN :
978-1-4244-4261-4
Electronic_ISBN :
978-1-4244-4262-1
Type :
conf
DOI :
10.1109/CSICC.2009.5349402
Filename :
5349402
Link To Document :
بازگشت