DocumentCode :
1797634
Title :
A classifier-based association test for imbalanced data derived from prediction theory
Author :
Mohr, J. ; Seo, S. ; Obermayer, Klaus
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., Tech. Univ. Berlin, Berlin, Germany
fYear :
2014
fDate :
6-11 July 2014
Firstpage :
487
Lastpage :
493
Abstract :
How can we test for group differences in multidimensional input patterns, such as functional magnetic resonance imaging measurements or gene expression values? One solution is to split the available data into training and test set, and to estimate the generalization accuracy of a classifier that predicts the group variable from the input pattern. If this lies significantly above chance level, we can reject the null hypothesis of no association. This test is straightforward for balanced data, where all groups are equally frequent in the data set. However, data sets collected in observational studies are often imbalanced. Then accuracy is no longer a suitable measure of performance, and balanced accuracy should be used instead. In this paper, we give an overview on existing analytical tests and use the framework of prediction theory to derive a new test for the balanced accuracy of a classifier. We then use numerical simulations to evaluate the type I error rate and the power of two tests for imbalanced data.
Keywords :
numerical analysis; pattern classification; prediction theory; analytical test; classifier generalization accuracy; classifier-based association test; imbalanced data; multidimensional input patterns; numerical simulations; prediction theory; test set; training set; type I error rate evaluation; Accuracy; Bayes methods; Error analysis; Noise level; Prediction theory; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks (IJCNN), 2014 International Joint Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4799-6627-1
Type :
conf
DOI :
10.1109/IJCNN.2014.6889547
Filename :
6889547
Link To Document :
بازگشت