DocumentCode
2369421
Title
Building text classifiers using positive and unlabeled examples
Author
Liu, Bing ; Dai, Yang ; Li, Xiaoli ; Lee, Wee Sun ; Yu, Philip S.
Author_Institution
Dept. of Comput. Sci., Univ. of Illinois at Chicago, IL, USA
fYear
2003
fDate
19-22 Nov. 2003
Firstpage
179
Lastpage
186
Abstract
We study the problem of building text classifiers using positive and unlabeled examples. The key feature of this problem is that there is no negative example for learning. Recently, a few techniques for solving this problem were proposed in the literature. These techniques are based on the same idea, which builds a classifier in two steps. Each existing technique uses a different method for each step. We first introduce some new methods for the two steps, and perform a comprehensive evaluation of all possible combinations of methods of the two steps. We then propose a more principled approach to solving the problem based on a biased formulation of SVM, and show experimentally that it is more accurate than the existing techniques.
Keywords
Bayes methods; belief networks; pattern classification; support vector machines; text analysis; SVM; positive example; text classifier; unlabeled example; Biomedical engineering; Computer science; Iterative algorithms; Labeling; Niobium; Performance evaluation; Sun; Support vector machine classification; Support vector machines; Text categorization;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining, 2003. ICDM 2003. Third IEEE International Conference on
Print_ISBN
0-7695-1978-4
Type
conf
DOI
10.1109/ICDM.2003.1250918
Filename
1250918
Link To Document