DocumentCode
1303838
Title
Semisupervised Classification With Cluster Regularization
Author
Soares, R.G.F. ; Huanhuan Chen ; Xin Yao
Author_Institution
Centre of Excellence for Res. in Comput. Intell. & Applic., Univ. of Birmingham, Birmingham, UK
Volume
23
Issue
11
fYear
2012
Firstpage
1779
Lastpage
1792
Abstract
Semisupervised classification (SSC) learns, from cheap unlabeled data and labeled data, to predict the labels of test instances. In order to make use of the information from unlabeled data, there should be an assumed relationship between the true class structure and the data distribution. One assumption is that data points clustered together are likely to have the same class label. In this paper, we propose a new algorithm, namely, cluster-based regularization (ClusterReg) for SSC, that takes the partition given by a clustering algorithm as a regularization term in the loss function of an SSC classifier. ClusterReg makes predictions according to the cluster structure together with limited labeled data. The experiments confirmed that ClusterReg has a good generalization ability for real-world problems. Its performance is excellent when data follows this cluster assumption. Even when these clusters have misleading overlaps, it still outperforms other state-of-the-art algorithms.
Keywords
learning (artificial intelligence); pattern classification; pattern clustering; ClusterReg; SSC classifier; class structure; cluster-based regularization; data distribution; data points clustering; loss function; regularization term; semisupervised classification; test instance label prediction; unlabeled data learning; Algorithm design and analysis; Clustering algorithms; Manifolds; Partitioning algorithms; Prediction algorithms; Robustness; Training; Clustering; machine learning; regularization; semisupervised learning;
fLanguage
English
Journal_Title
Neural Networks and Learning Systems, IEEE Transactions on
Publisher
ieee
ISSN
2162-237X
Type
jour
DOI
10.1109/TNNLS.2012.2214488
Filename
6317193
Link To Document