• DocumentCode
    501766
  • Title

    Using Multi-phase Cost-Sensitive Learning to Filtering Spam

  • Author

    Li, Wenbin ; Cheng, Yiying ; Liu, TaiFeng ; Zhang, Xindong ; Zhong, Ning

  • Author_Institution
    Sch. of Inf. Eng., Shijiazhuang Univ. of Econ., Shijiazhuang, China
  • Volume
    2
  • fYear
    2009
  • fDate
    12-14 Aug. 2009
  • Firstpage
    39
  • Lastpage
    44
  • Abstract
    This paper proposes a novel ensemble learning framework, namely, multiple-phase cost-sensitive ensemble learning (MPCSL) which simulates the means and process of human being learning.It consists of two types learning, i.e., direct learning which learns multiple weak learners from a training dataset via some homogeneous or heterogenous algorithms, and indirect learning that constructs a committee from the knowledge of the combined filters or other committees. This paper studies empirically the performance of MPCSL on spam filtering tasks.In the occasions of combining homogeneous and heterogeneous,how the performance of MPCSL changes is surveyed. The results shows that MPCSL is a compellent ensemble learning method for cost-sensitive tasks such as spam filtering.
  • Keywords
    information filtering; learning (artificial intelligence); unsolicited e-mail; ensemble learning framework; multiphase cost-sensitive learning; spam filtering; training dataset; Bagging; Boosting; Hybrid intelligent systems; Information filtering; Information filters; Learning systems; Paper technology; Text categorization; Unsolicited electronic mail; Voting; cost sensitive learning; machine learning; spam filtering; text processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Hybrid Intelligent Systems, 2009. HIS '09. Ninth International Conference on
  • Conference_Location
    Shenyang
  • Print_ISBN
    978-0-7695-3745-0
  • Type

    conf

  • DOI
    10.1109/HIS.2009.120
  • Filename
    5254416