• DocumentCode
    2786572
  • Title

    Bayesian statistical analysis for spams

  • Author

    Begriche, Youcef ; Serhrouchni, Ahmed

  • Author_Institution
    Inst. Telecom, Telecom ParisTech, Paris, France
  • fYear
    2010
  • fDate
    10-14 Oct. 2010
  • Firstpage
    989
  • Lastpage
    992
  • Abstract
    This paper presents a Bayesian statistical analysis applied to the spam problem. In most anti-spam related research, generally it is assumed that the probability of a spam occurrence is equal to 0.5, which is in our opinion unrealistic. It is also assumed that in the spam message, words are considered as an independent family of words. This makes us look at how the posterior probability behaves when the a priori probability is different from 0.5 and derive the consequences of the assumption of independent words on the posterior probability. The first assumption pushes us to define a prior and find a posterior probability laws to enhance the spam detection and increase the reliability decision. This analysis differs from previous results, that used the Bayesian approach to the anti-spam issue, especially through refinement and enhancement of various probability laws.
  • Keywords
    Bayes methods; probability; security of data; unsolicited e-mail; Bayesian statistical analysis; a priori probability; antispam; posterior probability; probability law; reliability decision; spam detection; spam message; spam occurrence; Bayesian methods; Filtering; Niobium; Telecommunications; Training; Unsolicited electronic mail; Bayesian statistical model; Binomial law; Classification; Conditional density; Distribution attachment; Ham(H); Spam(S);
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Local Computer Networks (LCN), 2010 IEEE 35th Conference on
  • Conference_Location
    Denver, CO
  • ISSN
    0742-1303
  • Print_ISBN
    978-1-4244-8387-7
  • Type

    conf

  • DOI
    10.1109/LCN.2010.5735846
  • Filename
    5735846