• DocumentCode
    707608
  • Title

    Investigation of effect of reducing dataset´s size on classification algorithms

  • Author

    Singhal, Neelam ; Ashraf, Mohd

  • Author_Institution
    Sch. of ICT, Gautam Buddha Univ., Noida, India
  • fYear
    2015
  • fDate
    11-13 March 2015
  • Firstpage
    2036
  • Lastpage
    2040
  • Abstract
    Data mining is now one of the most active field of research. Extracting those nuggets of information is becoming crucial and one of its important technique is classification. It helps to group the data in some predefined classes. Various techniques for classification exists which classifies the data using different algorithms. Each algorithm has its own area of best and worst performance. This paper concentrates on the four most famous algorithms, i.e., Decision Tree, Naïve Bayes, K Nearest Neighbour and Genetic Programming and the effect on their performance of time and accuracy when the number of instances are incrementally decreased. This paper will also investigate the difference in result when working with binary class or multiclass datasets and suggest the algorithms to follow when using certain kind of dataset.
  • Keywords
    Bayes methods; data mining; decision trees; genetic algorithms; pattern classification; binary class datasets; classification algorithms; data mining; decision tree; genetic programming; k nearest neighbour; multiclass datasets; naïve Bayes; Accuracy; Breast cancer; Classification algorithms; Data mining; Decision trees; Genetic programming; Timing; Accuracy; Decision Tree; Genetic Programming; K-Nearest Neighbor; Naïve Bayes;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computing for Sustainable Global Development (INDIACom), 2015 2nd International Conference on
  • Conference_Location
    New Delhi
  • Print_ISBN
    978-9-3805-4415-1
  • Type

    conf

  • Filename
    7100598