DocumentCode :
169511
Title :
An optimized k-NN classifier based on minimum spanning tree for email filtering
Author :
Chakrabarty, Ankush ; Roy, Sandip
Author_Institution :
Dept. of Comput. Applic., Future Inst. of Eng. & Manage., Kolkata, India
fYear :
2014
fDate :
9-11 Jan. 2014
Firstpage :
47
Lastpage :
52
Abstract :
In the era of internet where mailboxes are being flooded by unnecessary emails, it becomes troublesome and time consuming to organize and classify legitimate emails into folders. Although there has been extensive investigation of automatic document categorization, email classification gives rise to a number of challenges, and there has been relatively little study in this domain. This paper presents a framework for email classification using Enron email dataset based on an improved k-NN classification using a minimum spanning tree clustering algorithm considering the case where the number of clusters (email folders) are unknown initially. Such a classification can be useful in maintaining email and web directories, identifying spam and valid mails. Experimental results show that the proposed algorithm outperforms state of art classification algorithms like standard k-NN and Naïve Bayes classifiers and c4.5 decision tree classifier.
Keywords :
Bayes methods; Internet; decision trees; document handling; electronic mail; information filtering; pattern classification; pattern clustering; Enron email dataset; Internet; Web directories; automatic document categorization; c4.5 decision tree classifier; email filtering; email folders; k-NN classifier optimization; legitimate email classification; mailboxes; minimum spanning tree clustering algorithm; naive Bayes classifiers; Accuracy; Artificial neural networks; Decision trees; Training; Enron data set; Jaccard coefficient; Minimum Spanning tree; kNN classification; spam filtering;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Business and Information Management (ICBIM), 2014 2nd International Conference on
Conference_Location :
Durgapur
Print_ISBN :
978-1-4799-3263-4
Type :
conf
DOI :
10.1109/ICBIM.2014.6970931
Filename :
6970931
Link To Document :
بازگشت