DocumentCode
1966059
Title
Feature selection for collective classification
Author
Senliol, Baris ; Aral, Atakan ; Cataltepe, Zehra
Author_Institution
Comput. Eng. Dept., Istanbul Tech. Univ., Istanbul, Turkey
fYear
2009
fDate
14-16 Sept. 2009
Firstpage
286
Lastpage
291
Abstract
When in addition to node contents and labels, relations (links) between nodes and some unlabeled nodes are available, collective classification algorithms can be used. Collective classification algorithms, like ICA (iterative classification algorithm), determine labels for the unlabeled nodes based on the contents and/or labels of the neighboring nodes. Feature selection algorithms have been shown to improve classification accuracy for traditional machine learning algorithms. In this paper, we use a recent and successful feature selection algorithm, mRMR (minimum redundancy maximum relevance, Ding and Peng, 2003), on content features. On two scientific paper citation data sets, Cora and Citeseer, when only content information is used, we know that the selected features may result in almost as good performance as all the features. When feature selection is performed both on content and link information, even better classification accuracies are obtained. Feature selection considerably reduces the training time for both content only and ICA algorithms.
Keywords
iterative methods; learning (artificial intelligence); pattern classification; collective classification algorithms; feature selection algorithms; iterative classification algorithm; machine learning algorithms; minimum redundancy maximum relevance; Chemicals; Classification algorithms; Computer networks; Independent component analysis; Inference algorithms; Iterative algorithms; Logistics; Machine learning algorithms; Pattern recognition; Testing; Citeseer; Collective Classification; Cora; Feature Selection; Iterative Classification Algorithm (ICA); Logistic Regression; Minimum Redundancy Maximum Relevance (mRMR);
fLanguage
English
Publisher
ieee
Conference_Titel
Computer and Information Sciences, 2009. ISCIS 2009. 24th International Symposium on
Conference_Location
Guzelyurt
Print_ISBN
978-1-4244-5021-3
Electronic_ISBN
978-1-4244-5023-7
Type
conf
DOI
10.1109/ISCIS.2009.5291828
Filename
5291828
Link To Document