DocumentCode :
527346
Title :
The application of decision tree in Chinese email classification
Author :
Chen, Hao ; Zhan, Yan ; Li, Yan
Author_Institution :
Key Lab. of Machine Learning & Comput. Intell., Hebei Univ., Baoding, China
Volume :
1
fYear :
2010
fDate :
11-14 July 2010
Firstpage :
305
Lastpage :
308
Abstract :
Email is a kind of semi-structured document, some important attributes are contained in its structure, and especially using spam-specific features could improve the email classification results. In this paper, we apply decision tree data mining technique to dig out the potential association rules among these attributes of email, and then to identify unknown email´s category based on these rules. According to the experiment of applying numerous Chinese emails to our email classifier, the efficiency of our method is not lower than that of other existing methods of checking whole email content text. Meanwhile our method can reduce the cost of computation and consumption of system resources.
Keywords :
classification; data mining; decision trees; electronic mail; natural language processing; Chinese email classification; association rules; data mining; decision tree; semi-structured document; Association rules; Classification algorithms; Classification tree analysis; Electronic mail; Machine learning; Postal services; Association rule mining; Decision tree; Email classification; Spam-specific feature;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics (ICMLC), 2010 International Conference on
Conference_Location :
Qingdao
Print_ISBN :
978-1-4244-6526-2
Type :
conf
DOI :
10.1109/ICMLC.2010.5581046
Filename :
5581046
Link To Document :
بازگشت