DocumentCode
2988127
Title
Research on illegal E-mails recognition based on VSM and Statistical Decision Tree
Author
Wang, Ke-jian ; Han, Xian-zhong ; Guo, Tao
Author_Institution
Sch. of Inf. Sci. & Technol., Agric. Univ. of Hebei, Baoding
Volume
2
fYear
2008
fDate
30-31 Aug. 2008
Firstpage
480
Lastpage
484
Abstract
This paper introduces an algorithm based on VSM algorithm and statistical decision tree (SDT) to recognize illegal e-mails. The vector space model is simple and easy to operate. At first, the vector space model (VSM ) can filter some specific words which are often used in illegal e-mails. Then, SDT can judge illegal e-mails by Semanteme analyze. After the two steps, the illegal e-mails can also be easily identified and the recognition rate of illegal E-mails has been improved by basic experiments. Theoretical analysis and basic experiments shows that the illegal emails can be recognized effectively with VSM and SDT algorithm.
Keywords
decision trees; information filtering; statistical analysis; unsolicited e-mail; illegal e-mail recognition; information filtering; semanteme analyze; statistical decision tree; vector space model; Algorithm design and analysis; Decision trees; Electronic mail; Filters; Internet; Pattern recognition; Postal services; Space technology; Unsolicited electronic mail; Wavelet analysis; Illegal E-mails; Semanteme analyze; Statistical Decision Tree; Vector Space Model;
fLanguage
English
Publisher
ieee
Conference_Titel
Wavelet Analysis and Pattern Recognition, 2008. ICWAPR '08. International Conference on
Conference_Location
Hong Kong
Print_ISBN
978-1-4244-2238-8
Electronic_ISBN
978-1-4244-2239-5
Type
conf
DOI
10.1109/ICWAPR.2008.4635828
Filename
4635828
Link To Document