DocumentCode :
2199800
Title :
EBDA: An Effective Bottom-up Discretization Algorithm for Continuous Attributes
Author :
Sang, Yu ; Li, Keqiu ; Shen, Yanming
Author_Institution :
Dept. of Comput. Sci. & Eng., Dalian Univ. of Technol., Dalian, China
fYear :
2010
fDate :
June 29 2010-July 1 2010
Firstpage :
2455
Lastpage :
2462
Abstract :
Discretization algorithms have played an important role in many areas such as artificial intelligence, data mining and machine learning. In this paper, we propose an effective bottom-up discretization algorithm, namely EBDA. Firstly, we present a novel merging criterion which not only considers the effect of variance on degrees of freedom in the two merged intervals but also the effect of variance on interval difference and data distribution. In addition, we present a new stopping criterion with the aim to control the degree of misclassification and to merge intervals as many as possible. Detailed analysis shows that this algorithm can bring higher accuracy to the discretization process. Empirical experiments on 16 real data sets show that our proposed algorithm generates a better discretization scheme which significantly improves the accuracy of classification than existing algorithms by running C4.5.
Keywords :
data mining; learning (artificial intelligence); merging; Chi2 algorithm; artificial intelligence; continuous attributes; data mining; effective bottom-up discretization algorithm; machine learning; merging criterion; Accuracy; Algorithm design and analysis; Educational institutions; Machine learning algorithms; Merging; Upper bound; Chi2 algorithm; Classification; Discretization; Merging criterion; Stopping criterion;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and Information Technology (CIT), 2010 IEEE 10th International Conference on
Conference_Location :
Bradford
Print_ISBN :
978-1-4244-7547-6
Type :
conf
DOI :
10.1109/CIT.2010.421
Filename :
5578282
Link To Document :
بازگشت