DocumentCode
3699296
Title
Decision tree algorithm optimization research based on MapReduce
Author
Fangfang Yuan;Fusheng Lian;Xingjian Xu;Zhaohua Ji
Author_Institution
College of Computer and Information Engineering, Inner Mongolia Normal University, Hohhot 010010, China
fYear
2015
Firstpage
1010
Lastpage
1013
Abstract
With the advent of the computer science, the data volume that needed to be processed under many practical situations increases dramatically, challenging many traditional machine learning techniques. Bearing this in mind, we made an intensive study on the optimization of decision tree algorithm and its corresponding porting to the big data analysis in this paper. An optimized genetic algorithm is merged into the implementation of the decision tree algorithm above, and we also invent a parallel genetic decision tree algorithm using MapReduce, which is very suitable for analyzing big data in cloud computing environment. Experiment results show that our algorithm acquires a nearly linear speedup, keeping a similar classification accuracy at the same time.
Keywords
"Decision trees","Algorithm design and analysis","Classification algorithms","Optimization","Genetic algorithms","Genetics","Cloud computing"
Publisher
ieee
Conference_Titel
Software Engineering and Service Science (ICSESS), 2015 6th IEEE International Conference on
ISSN
2327-0586
Print_ISBN
978-1-4799-8352-0
Electronic_ISBN
2327-0594
Type
conf
DOI
10.1109/ICSESS.2015.7339225
Filename
7339225
Link To Document