DocumentCode
3224810
Title
Automatically balancing accuracy and comprehensibility in predictive modeling
Author
Johansson, Ulf ; König, Rikard ; Niklasson, Lars
Author_Institution
Sch. of Bus. & Informatics, Univ. of Boras, Sweden
Volume
2
fYear
2005
fDate
25-28 July 2005
Abstract
One specific problem, when performing predictive modeling, is the tradeoff between accuracy and comprehensibility. When comprehensible models are required, this normally rules out high-accuracy techniques like neural networks and committee machines. Therefore, an automated choice of a standard technique, known to generally produce sufficiently accurate and comprehensible models, would be of great value. In this paper, it is argued that this requirement is met by an ensemble of classifiers, followed by rule extraction. The proposed technique is demonstrated, using an ensemble of common classifiers and our rule extraction algorithm G-REX, on 17 publicly available data sets. The results presented demonstrate that the suggested technique performs very well. More specifically, the ensemble clearly outperforms the individual classifiers regarding accuracy, while the extracted models have accuracy similar to the individual classifiers. The extracted models are, however, significantly more compact than corresponding models created directly from the data set using the standard tool CART; thus providing higher comprehensibility.
Keywords
data mining; knowledge based systems; neural nets; G-REX; automatically balancing accuracy; committee machine; comprehensible model; data mining; neural network; predictive modeling; public available data set; rule extraction algorithm; standard tool CART; Artificial neural networks; Data mining; Fusion power generation; Informatics; Information analysis; Linear regression; Neural networks; Power generation; Predictive models; Training data; Accuracy; Committee Machines; Comprehensibility; Data Mining; Rule Extraction;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Fusion, 2005 8th International Conference on
Print_ISBN
0-7803-9286-8
Type
conf
DOI
10.1109/ICIF.2005.1592040
Filename
1592040
Link To Document