DocumentCode
3277576
Title
Improved C4.5 decision tree algorithm based on sample selection
Author
Fucai Chen ; Xiaowei Li ; Lixiong Liu
Author_Institution
China Nat. Digital Switching Syst. Eng. Technol. R&D Center, Zhengzhou, China
fYear
2013
fDate
23-25 May 2013
Firstpage
779
Lastpage
782
Abstract
To improve the classification accuracy and reduce the training time of large sample, and find the best training set, this paper proposes the improved C4.5 decision tree algorithm based on sample selection. The algorithm is based on the fact that decision tree can only get local optimal solution and has the bigger relativity with initial sample. In sample selection, we use iteration process to find the best training set. Using accuracy of the selected sample training as iteration Information is highly optimized for general use. Partition similarity is used for the best selection as the standard. Experiments show that the accuracy and time consumption of the proposed algorithm, aiming at classification and identification of large sample data, is better than C4.5 decision tree.
Keywords
decision trees; pattern classification; classification accuracy improvement; improved C4.5 decision tree algorithm; iteration Information; iteration process; large sample data classification; large sample data identification; local optimal solution; partition similarity; sample selection; training time reduction; IP networks; Training; C4.5 decision tree; partition similarity; sample selection;
fLanguage
English
Publisher
ieee
Conference_Titel
Software Engineering and Service Science (ICSESS), 2013 4th IEEE International Conference on
Conference_Location
Beijing
ISSN
2327-0586
Print_ISBN
978-1-4673-4997-0
Type
conf
DOI
10.1109/ICSESS.2013.6615421
Filename
6615421
Link To Document