DocumentCode
2283510
Title
Chinese spam filtering based on online active learning methods
Author
Sun, Guanglu ; Ma, Yingcai ; Shen, Yuewu ; Guo, Feng
Author_Institution
School of Computing Science and Technology, Harbin University of Science and Technology, Harbin, China
fYear
2012
fDate
18-21 Sept. 2012
Firstpage
1
Lastpage
5
Abstract
In this paper, new active learning methods are proposed to filter Chinese spam. It is time-consuming and expensive to label the spam emails in the large datasets. Active learning methods can conspicuously reduce labeling cost by identifying informative examples and speed up online Logistic Regression filter. The experiments illustrate that our methods not only decrease the number of label requests, but also improve the classification performance of spam filtering.
Keywords
information filtering; learning (artificial intelligence); regression analysis; unsolicited e-mail; Chinese spam filtering; large datasets; logistic regression filter; online active learning methods; spam emails; Educational institutions; Electronic mail; Filtering; Learning systems; Logistics; Machine learning; Training; Active learning; Chinese spam filtering; Logistic Regression;
fLanguage
English
Publisher
ieee
Conference_Titel
Strategic Technology (IFOST), 2012 7th International Forum on
Conference_Location
Tomsk
Print_ISBN
978-1-4673-1772-6
Type
conf
DOI
10.1109/IFOST.2012.6357637
Filename
6357637
Link To Document