• DocumentCode
    2369421
  • Title

    Building text classifiers using positive and unlabeled examples

  • Author

    Liu, Bing ; Dai, Yang ; Li, Xiaoli ; Lee, Wee Sun ; Yu, Philip S.

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Illinois at Chicago, IL, USA
  • fYear
    2003
  • fDate
    19-22 Nov. 2003
  • Firstpage
    179
  • Lastpage
    186
  • Abstract
    We study the problem of building text classifiers using positive and unlabeled examples. The key feature of this problem is that there is no negative example for learning. Recently, a few techniques for solving this problem were proposed in the literature. These techniques are based on the same idea, which builds a classifier in two steps. Each existing technique uses a different method for each step. We first introduce some new methods for the two steps, and perform a comprehensive evaluation of all possible combinations of methods of the two steps. We then propose a more principled approach to solving the problem based on a biased formulation of SVM, and show experimentally that it is more accurate than the existing techniques.
  • Keywords
    Bayes methods; belief networks; pattern classification; support vector machines; text analysis; SVM; positive example; text classifier; unlabeled example; Biomedical engineering; Computer science; Iterative algorithms; Labeling; Niobium; Performance evaluation; Sun; Support vector machine classification; Support vector machines; Text categorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2003. ICDM 2003. Third IEEE International Conference on
  • Print_ISBN
    0-7695-1978-4
  • Type

    conf

  • DOI
    10.1109/ICDM.2003.1250918
  • Filename
    1250918