• DocumentCode
    1991804
  • Title

    K-nearest neighbor and C4.5 algorithms as data mining methods: advantages and difficulties

  • Author

    Yahia, M.E. ; Ibrahim, B.A.

  • Author_Institution
    Dept. of Comput. Sci., Khartoum Univ., Sudan
  • fYear
    2003
  • fDate
    14-18 July 2003
  • Firstpage
    103
  • Abstract
    Summary form only given. Data mining is considered a fast growing technology as a result of the combination of some existing technologies such as machine learning, database systems, statistics and visualization. Some data mining algorithms has been used to offer a solution to classification problems in databases. To explain this task, comparison between the k-nearest neighbor (K-NN) and C4.5 algorithms in terms of their performance as classifier is carried out. While the K-NN is a supervised learning algorithm, C4.5 is an inductive learning algorithm. It is shown that the K-NN algorithm has the options for weight setting, normalization, editing the data and it can be used to develop hybrid systems for data mining. It is also shown the C4.5 algorithm can generate rules from a single tree with the ability to transform multiple decision trees into a set of classification rules and it can be used to better scale up rule generation in terms of size and number of rules and learning time.
  • Keywords
    classification; data mining; database management systems; decision trees; learning by example; C4 5 algorithms; K-NN; classification rule generation; data editing; data mining method; data normalization; data visualization; database systems; decision trees; hybrid systems; inductive learning; k-nearest neighbor; machine learning; supervised learning algorithm; Classification tree analysis; Data mining; Data visualization; Database systems; Decision trees; Machine learning; Machine learning algorithms; Statistics; Supervised learning; Visual databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Systems and Applications, 2003. Book of Abstracts. ACS/IEEE International Conference on
  • Conference_Location
    Tunis, Tunisia
  • Print_ISBN
    0-7803-7983-7
  • Type

    conf

  • DOI
    10.1109/AICCSA.2003.1227535
  • Filename
    1227535