DocumentCode
2263163
Title
Spam Detection Using Dynamic Weighted Voting Based on Clustering
Author
Saeedian, Mehrnoush Famil ; Beigy, Hamid
Author_Institution
Dept. of Comput. Eng., Sharif Univ. of Technol., Tehran
Volume
2
fYear
2008
fDate
20-22 Dec. 2008
Firstpage
122
Lastpage
126
Abstract
In the last decade spam detection has been addressed as a text classification or categorization problem. In this paper we propose a new dynamic weighted voting method based on the combination of clustering and weighted voting, and apply it to the task of spam filtering. In order to classify a new sample, it first compares with all cluster centroids and its similarity to each cluster is identified; Classifiers in the vicinity of the input sample obtain greater weight for the final decision of the ensemble. The evaluation shows that the algorithm outperforms pure SVM.
Keywords
e-mail filters; pattern classification; pattern clustering; security of data; unsolicited e-mail; clustering; dynamic weighted voting method; spam detection; spam filtering; text categorization problem; text classification problem; Filtering; Filters; Machine learning algorithms; Nearest neighbor searches; Niobium; Support vector machine classification; Support vector machines; Training data; Unsolicited electronic mail; Voting; classification; classifier fusion; clustering; ensemble; spam;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Information Technology Application, 2008. IITA '08. Second International Symposium on
Conference_Location
Shanghai
Print_ISBN
978-0-7695-3497-8
Type
conf
DOI
10.1109/IITA.2008.140
Filename
4739740
Link To Document