• DocumentCode
    482176
  • Title

    An Integration of CoTraining and Affinity Propagation for PU Text Classification

  • Author

    Luo, Na ; Yuan, Fuyu ; Zuo, Wanli

  • Author_Institution
    Coll. of Comput. & Sci. & Technol., JiLin Univ., Changchun
  • Volume
    1
  • fYear
    2009
  • fDate
    22-24 Jan. 2009
  • Firstpage
    150
  • Lastpage
    154
  • Abstract
    Under the framework of PU(Positive data and Unlabeled data), this paper originally proposes a three-setp algorithm. First, CoTraining is employed for filtering out the likely positive data from the unlabeled dataset U. Second, affinity propagation (AP) approach attempts to pick out the strong positive from likely positive set which is produced in first step. Those data picked out can be supplied to positive dataset P. Finally, a linear One-Class SVM will learn from both the purified U as negative and the expanded P as positive. Because of the algorithm´s characteristic of automatic expanding positive dataset, the proposed algorithm especially performs well in situations where given positive dataset P is insufficient. A comprehensive experiment had proved that our algorithm is preferable to the existing ones.
  • Keywords
    support vector machines; text analysis; text editing; CoTraining; affinity propagation; linear one-class SVM; positive dataset; support vector machine; text classification; three-setp algorithm; unlabeled data; Chemical technology; Chemistry; Data engineering; Educational institutions; Employment; Filtering algorithms; Laboratories; Supervised learning; Support vector machines; Text categorization; Affinity Propagation; CoTraining; PU Classification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Engineering and Technology, 2009. ICCET '09. International Conference on
  • Conference_Location
    Singapore
  • Print_ISBN
    978-1-4244-3334-6
  • Type

    conf

  • DOI
    10.1109/ICCET.2009.131
  • Filename
    4769445