• DocumentCode
    3699296
  • Title

    Decision tree algorithm optimization research based on MapReduce

  • Author

    Fangfang Yuan;Fusheng Lian;Xingjian Xu;Zhaohua Ji

  • Author_Institution
    College of Computer and Information Engineering, Inner Mongolia Normal University, Hohhot 010010, China
  • fYear
    2015
  • Firstpage
    1010
  • Lastpage
    1013
  • Abstract
    With the advent of the computer science, the data volume that needed to be processed under many practical situations increases dramatically, challenging many traditional machine learning techniques. Bearing this in mind, we made an intensive study on the optimization of decision tree algorithm and its corresponding porting to the big data analysis in this paper. An optimized genetic algorithm is merged into the implementation of the decision tree algorithm above, and we also invent a parallel genetic decision tree algorithm using MapReduce, which is very suitable for analyzing big data in cloud computing environment. Experiment results show that our algorithm acquires a nearly linear speedup, keeping a similar classification accuracy at the same time.
  • Keywords
    "Decision trees","Algorithm design and analysis","Classification algorithms","Optimization","Genetic algorithms","Genetics","Cloud computing"
  • Publisher
    ieee
  • Conference_Titel
    Software Engineering and Service Science (ICSESS), 2015 6th IEEE International Conference on
  • ISSN
    2327-0586
  • Print_ISBN
    978-1-4799-8352-0
  • Electronic_ISBN
    2327-0594
  • Type

    conf

  • DOI
    10.1109/ICSESS.2015.7339225
  • Filename
    7339225