Title :
Classification of unbalanced data with transparent kernels
Author :
Lee, K.K. ; Gunn, S.R. ; Harris, C.J. ; Reed, P.A.S.
Author_Institution :
Dept. of Electron. & Comput. Sci., Southampton Univ., UK
Abstract :
Two important issues regarding data driven classification are addressed here: model interpretation and unbalanced data. The aim is to build data driven classifiers that provide good predictive performance for a set of unbalanced data and enhance the understanding of a model by enabling input/output dependencies that exist to be visualised. The classification method is demonstrated on an unbalanced data set that describes fatigue crack initiation in automotive camshafts. To generate interpretable models, the support vector parsimonious analysis of variance technique is extended to the classification domain. The technique enables an additive decomposition of low dimensional kernel models to be recovered, enhancing model visualization. The standard averaging technique used to assess the performance of the model is inappropriate for unbalanced data. The geometric mean is used. These resulting components had low dimensions, and consequently can be visualized
Keywords :
data visualisation; fatigue cracks; learning automata; mechanical engineering computing; neural nets; pattern classification; automotive camshafts; data visualisation; fatigue crack; geometric mean; model interpretation; support vector machine; transparent kernels; unbalanced data classification; Automotive engineering; Camshafts; Costs; Data visualization; Fatigue; Kernel; Microstructure; Predictive models; Support vector machine classification; Support vector machines;
Conference_Titel :
Neural Networks, 2001. Proceedings. IJCNN '01. International Joint Conference on
Conference_Location :
Washington, DC
Print_ISBN :
0-7803-7044-9
DOI :
10.1109/IJCNN.2001.938744