Title :
Automatically balancing accuracy and comprehensibility in predictive modeling
Author :
Johansson, Ulf ; König, Rikard ; Niklasson, Lars
Author_Institution :
Sch. of Bus. & Informatics, Univ. of Boras, Sweden
Abstract :
One specific problem, when performing predictive modeling, is the tradeoff between accuracy and comprehensibility. When comprehensible models are required, this normally rules out high-accuracy techniques like neural networks and committee machines. Therefore, an automated choice of a standard technique, known to generally produce sufficiently accurate and comprehensible models, would be of great value. In this paper, it is argued that this requirement is met by an ensemble of classifiers, followed by rule extraction. The proposed technique is demonstrated, using an ensemble of common classifiers and our rule extraction algorithm G-REX, on 17 publicly available data sets. The results presented demonstrate that the suggested technique performs very well. More specifically, the ensemble clearly outperforms the individual classifiers regarding accuracy, while the extracted models have accuracy similar to the individual classifiers. The extracted models are, however, significantly more compact than corresponding models created directly from the data set using the standard tool CART; thus providing higher comprehensibility.
Keywords :
data mining; knowledge based systems; neural nets; G-REX; automatically balancing accuracy; committee machine; comprehensible model; data mining; neural network; predictive modeling; public available data set; rule extraction algorithm; standard tool CART; Artificial neural networks; Data mining; Fusion power generation; Informatics; Information analysis; Linear regression; Neural networks; Power generation; Predictive models; Training data; Accuracy; Committee Machines; Comprehensibility; Data Mining; Rule Extraction;
Conference_Titel :
Information Fusion, 2005 8th International Conference on
Print_ISBN :
0-7803-9286-8
DOI :
10.1109/ICIF.2005.1592040