DocumentCode
501766
Title
Using Multi-phase Cost-Sensitive Learning to Filtering Spam
Author
Li, Wenbin ; Cheng, Yiying ; Liu, TaiFeng ; Zhang, Xindong ; Zhong, Ning
Author_Institution
Sch. of Inf. Eng., Shijiazhuang Univ. of Econ., Shijiazhuang, China
Volume
2
fYear
2009
fDate
12-14 Aug. 2009
Firstpage
39
Lastpage
44
Abstract
This paper proposes a novel ensemble learning framework, namely, multiple-phase cost-sensitive ensemble learning (MPCSL) which simulates the means and process of human being learning.It consists of two types learning, i.e., direct learning which learns multiple weak learners from a training dataset via some homogeneous or heterogenous algorithms, and indirect learning that constructs a committee from the knowledge of the combined filters or other committees. This paper studies empirically the performance of MPCSL on spam filtering tasks.In the occasions of combining homogeneous and heterogeneous,how the performance of MPCSL changes is surveyed. The results shows that MPCSL is a compellent ensemble learning method for cost-sensitive tasks such as spam filtering.
Keywords
information filtering; learning (artificial intelligence); unsolicited e-mail; ensemble learning framework; multiphase cost-sensitive learning; spam filtering; training dataset; Bagging; Boosting; Hybrid intelligent systems; Information filtering; Information filters; Learning systems; Paper technology; Text categorization; Unsolicited electronic mail; Voting; cost sensitive learning; machine learning; spam filtering; text processing;
fLanguage
English
Publisher
ieee
Conference_Titel
Hybrid Intelligent Systems, 2009. HIS '09. Ninth International Conference on
Conference_Location
Shenyang
Print_ISBN
978-0-7695-3745-0
Type
conf
DOI
10.1109/HIS.2009.120
Filename
5254416
Link To Document