• DocumentCode
    3277576
  • Title

    Improved C4.5 decision tree algorithm based on sample selection

  • Author

    Fucai Chen ; Xiaowei Li ; Lixiong Liu

  • Author_Institution
    China Nat. Digital Switching Syst. Eng. Technol. R&D Center, Zhengzhou, China
  • fYear
    2013
  • fDate
    23-25 May 2013
  • Firstpage
    779
  • Lastpage
    782
  • Abstract
    To improve the classification accuracy and reduce the training time of large sample, and find the best training set, this paper proposes the improved C4.5 decision tree algorithm based on sample selection. The algorithm is based on the fact that decision tree can only get local optimal solution and has the bigger relativity with initial sample. In sample selection, we use iteration process to find the best training set. Using accuracy of the selected sample training as iteration Information is highly optimized for general use. Partition similarity is used for the best selection as the standard. Experiments show that the accuracy and time consumption of the proposed algorithm, aiming at classification and identification of large sample data, is better than C4.5 decision tree.
  • Keywords
    decision trees; pattern classification; classification accuracy improvement; improved C4.5 decision tree algorithm; iteration Information; iteration process; large sample data classification; large sample data identification; local optimal solution; partition similarity; sample selection; training time reduction; IP networks; Training; C4.5 decision tree; partition similarity; sample selection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Engineering and Service Science (ICSESS), 2013 4th IEEE International Conference on
  • Conference_Location
    Beijing
  • ISSN
    2327-0586
  • Print_ISBN
    978-1-4673-4997-0
  • Type

    conf

  • DOI
    10.1109/ICSESS.2013.6615421
  • Filename
    6615421