Title :
Constrained minimum cut for classification using labeled and unlabeled data
Author_Institution :
Dept. of Comput. Sci., Hong Kong Baptist Univ., China
Abstract :
The use of unlabeled data has lead to an improvement in classification accuracy for a variety of classification problems via co-training approaches. In the co-training approach, the data has to be available in a dual view representation or two distinct classifiers are required. In this paper, a unified energy equation for classification combining labeled data and unlabeled data is introduced. This classification formulation is posed as a constrained minimum cut problem integrating labeling information on labeled data and cluster similarity information on unlabeled data for joint estimation. A novel constrained randomized contraction algorithm is proposed for finding the solution to the constrained minimum cuts problem. Experimental results on standard datasets and synthetic datasets are presented.
Keywords :
pattern classification; pattern clustering; randomised algorithms; unsupervised learning; classification; cluster similarity information; co-training approaches; constrained minimum cut; constrained randomized contraction algorithm; datasets; joint estimation; labeled data; unified energy equation; unlabeled data; Clustering algorithms; Computer science; Equations; Inference algorithms; Labeling; Machine learning; Machine learning algorithms; Support vector machine classification; Support vector machines; Web pages;
Conference_Titel :
Computer Vision and Pattern Recognition, 2001. CVPR 2001. Proceedings of the 2001 IEEE Computer Society Conference on
Print_ISBN :
0-7695-1272-0
DOI :
10.1109/CVPR.2001.991017