Title :
Enhancing the performance of decision tree: A research study of dealing with unbalanced data
Author :
Almas, Amera ; Farquad, M. A H ; Avala, N. S Ranganath ; Sultana, Jabeen
Author_Institution :
Coll. of Comput. & Inf. Technol., Taif Univ., Taif, Saudi Arabia
Abstract :
Computational intelligence techniques are proved to be outperforming compared to standard statistical techniques, specifically when dealing with large, unbalanced and high dimensional data. In this paper we present an enhancement approach for improving the performance of decision tree using Support Vector Machine (SVM) when dealing with unbalanced data. The proposed approach modifies the available training data according to the predictions of SVM and this modified training data is then used to train decision tree. As the dataset at hand i.e. COIL data is highly unbalanced with 94:6 class distribution ratios, we also employed various standard sampling techniques for extensive analysis. Based on sensitivity measure, it is observed that the proposed approach enhanced the efficiency of decision tree exceptionally well. Other intelligent methods can be tested in place of DT.
Keywords :
data analysis; decision trees; pattern classification; sampling methods; support vector machines; COIL data; SVM prediction; class distribution ratio; computational intelligence technique; decision tree performance enhancement; extensive analysis; high dimensional data; sampling technique; sensitivity measure; statistical technique; support vector machine; unbalanced data; Accuracy; Decision trees; Machine learning; Sensitivity; Support vector machines; Training; Training data; CoIL Dataset; Decision Tree; Support Vector Machine; Unbalanced Data;
Conference_Titel :
Digital Information Management (ICDIM), 2012 Seventh International Conference on
Conference_Location :
Macau
Print_ISBN :
978-1-4673-2428-1
DOI :
10.1109/ICDIM.2012.6360115