DocumentCode :
1565935
Title :
Multivariate interdependent discretization for continuous attribute
Author :
Chao, Sam ; Li, Yiping
Author_Institution :
Fac. of Sci. & Technol., Macau Univ., China
Volume :
1
fYear :
2005
Firstpage :
167
Abstract :
Decision tree is one of the most widely used and practical methods in the data mining and machine learning discipline. However, many discretization algorithms developed in this field focus on univariate only, which is inadequate to handle the critical problems especially owned by medical domain. In this paper, we propose a new multivariate discretization method called multivariate interdependent discretization for continuous attributes - MIDCA. Our novel algorithm can minimize the uncertainty between the interdependent attribute and the continuous-valued attribute, and at the same time to maximize their correlation. The empirical results demonstrate a comparison of performance of various decision tree algorithms on twelve real-life datasets from UCI repository.
Keywords :
data mining; decision trees; learning (artificial intelligence); UCI repository; continuous-valued attribute; data mining; decision tree algorithms; interdependent attribute; machine learning; multivariate interdependent discretization; Aging; Bayesian methods; Blood pressure; Chaos; Data mining; Decision trees; Hypertension; Inference algorithms; Machine learning; Machine learning algorithms; Correlated Attribute; Data Mining; Interdependent; Machine Learning; Multivariate Discretization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Technology and Applications, 2005. ICITA 2005. Third International Conference on
Print_ISBN :
0-7695-2316-1
Type :
conf
DOI :
10.1109/ICITA.2005.188
Filename :
1488790
Link To Document :
بازگشت