Title :
A Binarization Approach to Email Categorization using Binary Decision Tree
Author :
Xia, Yun-qing ; Wang, Jian-Xin ; Zheng, Fang ; Liu, Yi
Author_Institution :
Tsinghua Univ., Beijing
Abstract :
Binarization approaches are found promising in performing the task of email categorization. Amongst the standard binarization approaches, the one combining the some-against-rest binarization method and round robin assembling method is discovered most effective. However, two drawbacks are worth noting, i.e., effectiveness of the some-against-rest binarization method and computational complexity in training. This paper presents an algorithm in finding the binary decision tree, i.e. the optimal some-against-rest binarization solution. In classification stage, the binary decision tree is combined with the elimination method to address the two problems. Experimental results show that the binary decision tree is more effective in email categorization and computationally less complex in training.
Keywords :
binary decision diagrams; decision trees; electronic mail; information filtering; text analysis; binarization approach; binary decision tree; computational complexity; elimination method; email categorization; email classification; round robin assembling method; some-against-rest binarization method; Assembly; Classification tree analysis; Computational complexity; Cybernetics; Decision trees; Machine learning; Partitioning algorithms; Round robin; Speech; Testing; Binary decision tree; Email categorization; Some-against-rest binarizatoin;
Conference_Titel :
Machine Learning and Cybernetics, 2007 International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-0973-0
Electronic_ISBN :
978-1-4244-0973-0
DOI :
10.1109/ICMLC.2007.4370746