• DocumentCode
    1936763
  • Title

    A Binarization Approach to Email Categorization using Binary Decision Tree

  • Author

    Xia, Yun-qing ; Wang, Jian-Xin ; Zheng, Fang ; Liu, Yi

  • Author_Institution
    Tsinghua Univ., Beijing
  • Volume
    6
  • fYear
    2007
  • fDate
    19-22 Aug. 2007
  • Firstpage
    3459
  • Lastpage
    3464
  • Abstract
    Binarization approaches are found promising in performing the task of email categorization. Amongst the standard binarization approaches, the one combining the some-against-rest binarization method and round robin assembling method is discovered most effective. However, two drawbacks are worth noting, i.e., effectiveness of the some-against-rest binarization method and computational complexity in training. This paper presents an algorithm in finding the binary decision tree, i.e. the optimal some-against-rest binarization solution. In classification stage, the binary decision tree is combined with the elimination method to address the two problems. Experimental results show that the binary decision tree is more effective in email categorization and computationally less complex in training.
  • Keywords
    binary decision diagrams; decision trees; electronic mail; information filtering; text analysis; binarization approach; binary decision tree; computational complexity; elimination method; email categorization; email classification; round robin assembling method; some-against-rest binarization method; Assembly; Classification tree analysis; Computational complexity; Cybernetics; Decision trees; Machine learning; Partitioning algorithms; Round robin; Speech; Testing; Binary decision tree; Email categorization; Some-against-rest binarizatoin;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics, 2007 International Conference on
  • Conference_Location
    Hong Kong
  • Print_ISBN
    978-1-4244-0973-0
  • Electronic_ISBN
    978-1-4244-0973-0
  • Type

    conf

  • DOI
    10.1109/ICMLC.2007.4370746
  • Filename
    4370746