DocumentCode :
1991804
Title :
K-nearest neighbor and C4.5 algorithms as data mining methods: advantages and difficulties
Author :
Yahia, M.E. ; Ibrahim, B.A.
Author_Institution :
Dept. of Comput. Sci., Khartoum Univ., Sudan
fYear :
2003
fDate :
14-18 July 2003
Firstpage :
103
Abstract :
Summary form only given. Data mining is considered a fast growing technology as a result of the combination of some existing technologies such as machine learning, database systems, statistics and visualization. Some data mining algorithms has been used to offer a solution to classification problems in databases. To explain this task, comparison between the k-nearest neighbor (K-NN) and C4.5 algorithms in terms of their performance as classifier is carried out. While the K-NN is a supervised learning algorithm, C4.5 is an inductive learning algorithm. It is shown that the K-NN algorithm has the options for weight setting, normalization, editing the data and it can be used to develop hybrid systems for data mining. It is also shown the C4.5 algorithm can generate rules from a single tree with the ability to transform multiple decision trees into a set of classification rules and it can be used to better scale up rule generation in terms of size and number of rules and learning time.
Keywords :
classification; data mining; database management systems; decision trees; learning by example; C4 5 algorithms; K-NN; classification rule generation; data editing; data mining method; data normalization; data visualization; database systems; decision trees; hybrid systems; inductive learning; k-nearest neighbor; machine learning; supervised learning algorithm; Classification tree analysis; Data mining; Data visualization; Database systems; Decision trees; Machine learning; Machine learning algorithms; Statistics; Supervised learning; Visual databases;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Systems and Applications, 2003. Book of Abstracts. ACS/IEEE International Conference on
Conference_Location :
Tunis, Tunisia
Print_ISBN :
0-7803-7983-7
Type :
conf
DOI :
10.1109/AICCSA.2003.1227535
Filename :
1227535
Link To Document :
بازگشت