Title :
Building text classifiers using positive and unlabeled examples
Author :
Liu, Bing ; Dai, Yang ; Li, Xiaoli ; Lee, Wee Sun ; Yu, Philip S.
Author_Institution :
Dept. of Comput. Sci., Univ. of Illinois at Chicago, IL, USA
Abstract :
We study the problem of building text classifiers using positive and unlabeled examples. The key feature of this problem is that there is no negative example for learning. Recently, a few techniques for solving this problem were proposed in the literature. These techniques are based on the same idea, which builds a classifier in two steps. Each existing technique uses a different method for each step. We first introduce some new methods for the two steps, and perform a comprehensive evaluation of all possible combinations of methods of the two steps. We then propose a more principled approach to solving the problem based on a biased formulation of SVM, and show experimentally that it is more accurate than the existing techniques.
Keywords :
Bayes methods; belief networks; pattern classification; support vector machines; text analysis; SVM; positive example; text classifier; unlabeled example; Biomedical engineering; Computer science; Iterative algorithms; Labeling; Niobium; Performance evaluation; Sun; Support vector machine classification; Support vector machines; Text categorization;
Conference_Titel :
Data Mining, 2003. ICDM 2003. Third IEEE International Conference on
Print_ISBN :
0-7695-1978-4
DOI :
10.1109/ICDM.2003.1250918