Title :
Discretization algorithm based on difference-similitude set theory
Author :
Wu, Ming ; Huang, Xiao-Chun ; Luo, Xin ; Pu-Liu Van
Author_Institution :
Sch. of Electron. Inf., Wuhan Univ., China
Abstract :
A new discretization algorithm based on difference-similitude set theory (DSST) is presented. It is different from the known algorithms because the reduction in the information system is prior to the data discretization. Our algorithm discretizes the attribute values by analyzing difference sets of the newly-built decision table. The discretization is separated into three steps. The first step is to reduce the information system without any discretization. The second step is to rebuild a new decision table according to the reduction result obtained in the first step. The third step is to discretize the attribute values by analyzing the difference sets of the newly-built decision table. We applied our algorithm in UCI databases and compare our method with other algorithms at the end of the paper. The experiments show that it is an effective algorithm.
Keywords :
data reduction; decision tables; information systems; knowledge based systems; set theory; decision table; difference set; discretization algorithm; information system; knowledge reduction; similitude set; Algorithm design and analysis; Clustering algorithms; Cybernetics; Database systems; Decision support systems; Information systems; Internet; Machine learning; Set theory; Discretization; difference set; information system; knowledge reduction; similitude set;
Conference_Titel :
Machine Learning and Cybernetics, 2005. Proceedings of 2005 International Conference on
Conference_Location :
Guangzhou, China
Print_ISBN :
0-7803-9091-1
DOI :
10.1109/ICMLC.2005.1527228