DocumentCode
2786572
Title
Bayesian statistical analysis for spams
Author
Begriche, Youcef ; Serhrouchni, Ahmed
Author_Institution
Inst. Telecom, Telecom ParisTech, Paris, France
fYear
2010
fDate
10-14 Oct. 2010
Firstpage
989
Lastpage
992
Abstract
This paper presents a Bayesian statistical analysis applied to the spam problem. In most anti-spam related research, generally it is assumed that the probability of a spam occurrence is equal to 0.5, which is in our opinion unrealistic. It is also assumed that in the spam message, words are considered as an independent family of words. This makes us look at how the posterior probability behaves when the a priori probability is different from 0.5 and derive the consequences of the assumption of independent words on the posterior probability. The first assumption pushes us to define a prior and find a posterior probability laws to enhance the spam detection and increase the reliability decision. This analysis differs from previous results, that used the Bayesian approach to the anti-spam issue, especially through refinement and enhancement of various probability laws.
Keywords
Bayes methods; probability; security of data; unsolicited e-mail; Bayesian statistical analysis; a priori probability; antispam; posterior probability; probability law; reliability decision; spam detection; spam message; spam occurrence; Bayesian methods; Filtering; Niobium; Telecommunications; Training; Unsolicited electronic mail; Bayesian statistical model; Binomial law; Classification; Conditional density; Distribution attachment; Ham(H); Spam(S);
fLanguage
English
Publisher
ieee
Conference_Titel
Local Computer Networks (LCN), 2010 IEEE 35th Conference on
Conference_Location
Denver, CO
ISSN
0742-1303
Print_ISBN
978-1-4244-8387-7
Type
conf
DOI
10.1109/LCN.2010.5735846
Filename
5735846
Link To Document