DocumentCode :
1592276
Title :
Adaptive filtering of spam
Author :
Pelletier, L. ; Almhana, J. ; Choulakian, V.
Author_Institution :
GRETI, Moncton Univ., NB, Canada
fYear :
2004
Firstpage :
218
Lastpage :
224
Abstract :
We present a new spam filter which acts as an additional layer in the spam filtering process. This filter is based on what we call a representative vocabulary. Spam e-mails are divided into categories in which each category is represented by a set of tokens which form a representative text (RT). Tokens are strings of characters (words, sentences, or sometimes meaningless strings of characters). This RT is used to compute a resemblance ratio with incoming e-mails. With this ratio, we decide whether the incoming e-mail is a spam. This filter was implemented and integrated to Spamihilator software. Some experimental and interesting results are presented.
Keywords :
text analysis; unsolicited e-mail; vocabulary; Spamihilator software; adaptive filtering; adaptive spam filtering; character strings; representative text; representative vocabulary; resemblance ratio; unsolicited e-mail; Adaptive filters; Bandwidth; Bayesian methods; Costs; Electronic mail; Information filtering; Information filters; Unsolicited electronic mail; Vocabulary; Web and internet services;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communication Networks and Services Research, 2004. Proceedings. Second Annual Conference on
Print_ISBN :
0-7695-2096-0
Type :
conf
DOI :
10.1109/DNSR.2004.1344731
Filename :
1344731
Link To Document :
بازگشت