• DocumentCode
    527346
  • Title

    The application of decision tree in Chinese email classification

  • Author

    Chen, Hao ; Zhan, Yan ; Li, Yan

  • Author_Institution
    Key Lab. of Machine Learning & Comput. Intell., Hebei Univ., Baoding, China
  • Volume
    1
  • fYear
    2010
  • fDate
    11-14 July 2010
  • Firstpage
    305
  • Lastpage
    308
  • Abstract
    Email is a kind of semi-structured document, some important attributes are contained in its structure, and especially using spam-specific features could improve the email classification results. In this paper, we apply decision tree data mining technique to dig out the potential association rules among these attributes of email, and then to identify unknown email´s category based on these rules. According to the experiment of applying numerous Chinese emails to our email classifier, the efficiency of our method is not lower than that of other existing methods of checking whole email content text. Meanwhile our method can reduce the cost of computation and consumption of system resources.
  • Keywords
    classification; data mining; decision trees; electronic mail; natural language processing; Chinese email classification; association rules; data mining; decision tree; semi-structured document; Association rules; Classification algorithms; Classification tree analysis; Electronic mail; Machine learning; Postal services; Association rule mining; Decision tree; Email classification; Spam-specific feature;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics (ICMLC), 2010 International Conference on
  • Conference_Location
    Qingdao
  • Print_ISBN
    978-1-4244-6526-2
  • Type

    conf

  • DOI
    10.1109/ICMLC.2010.5581046
  • Filename
    5581046