DocumentCode
2033649
Title
Rough set and CART approaches to mining incomplete data
Author
Grzymala-Busse, Jerzy W.
Author_Institution
Dept. of Electr. Eng. & Comput. Sci., Univ. of Kansas, Lawrence, KS, USA
fYear
2010
fDate
7-10 Dec. 2010
Firstpage
214
Lastpage
219
Abstract
Many data sets are incomplete, i.e., are affected by missing attribute values. In this paper, we report results of experiments on two approaches to missing attribute values. The first one is based on rough set theory and rule induction, the second one is the CART method that uses surrogate splits for handling missing attribute values and that generates decision trees. As follows from our experiments, both approaches are comparable in terms of an error rate. Thus, for a specific data set the best method of handling missing attribute values should be selected individually.
Keywords
data mining; decision trees; rough set theory; CART method; data mining; decision tree generation; missing attribute values; rough set theory; rule induction; Approximation methods; Breast cancer; Data mining; Decision trees; Error analysis; Image segmentation; CART algorithm for decision tree generation; LERS data mining system; MLEM2 algorithm for rule induction; incomplete data sets; missing attribute values;
fLanguage
English
Publisher
ieee
Conference_Titel
Soft Computing and Pattern Recognition (SoCPaR), 2010 International Conference of
Conference_Location
Paris
Print_ISBN
978-1-4244-7897-2
Type
conf
DOI
10.1109/SOCPAR.2010.5685860
Filename
5685860
Link To Document