مرکز منطقه ای اطلاع رساني علوم و فناوري - Scaling Up Graph-Based Semisupervised Learning via Prototype Vector Machines

DocumentCode :

88102

Title :

Scaling Up Graph-Based Semisupervised Learning via Prototype Vector Machines

Author :

Kai Zhang ; Liang Lan ; Kwok, James T. ; Vucetic, Slobodan ; Parvin, Bahram

Author_Institution :

NEC Labs. America, Inc., Princeton, NJ, USA

Volume :

Issue :

fYear :

2015

fDate :

Mar-15

Firstpage :

444

Lastpage :

457

Abstract :

When the amount of labeled data are limited, semisupervised learning can improve the learner´s performance by also using the often easily available unlabeled data. In particular, a popular approach requires the learned function to be smooth on the underlying data manifold. By approximating this manifold as a weighted graph, such graph-based techniques can often achieve state-of-the-art performance. However, their high time and space complexities make them less attractive on large data sets. In this paper, we propose to scale up graph-based semisupervised learning using a set of sparse prototypes derived from the data. These prototypes serve as a small set of data representatives, which can be used to approximate the graph-based regularizer and to control model complexity. Consequently, both training and testing become much more efficient. Moreover, when the Gaussian kernel is used to define the graph affinity, a simple and principled method to select the prototypes can be obtained. Experiments on a number of real-world data sets demonstrate encouraging performance and scaling properties of the proposed approach. It also compares favorably with models learned via ℓ₁-regularization at the same level of model sparsity. These results demonstrate the efficacy of the proposed approach in producing highly parsimonious and accurate models for semisupervised learning.

Keywords :

Gaussian processes; computational complexity; data analysis; graph theory; learning (artificial intelligence); Gaussian kernel; graph affinity; graph-based regularizer; graph-based semisupervised learning; prototype vector machines; space complexity; time complexity; weighted graph; Approximation methods; Kernel; Laplace equations; Manifolds; Prototypes; Semisupervised learning; Training; Graph-based methods; large data sets; low-rank approximation; manifold regularization; semisupervised learning; semisupervised learning.;

fLanguage :

English

Journal_Title :

Neural Networks and Learning Systems, IEEE Transactions on

Publisher :

ieee

ISSN :

2162-237X

Type :

jour

DOI :

10.1109/TNNLS.2014.2315526

Filename :

6803073

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=88102