DocumentCode :
3124947
Title :
An Analysis of Performance Measures for Binary Classifiers
Author :
Parker, Charles
Author_Institution :
BigML, Inc., Corvallis, OR, USA
fYear :
2011
fDate :
11-14 Dec. 2011
Firstpage :
517
Lastpage :
526
Abstract :
If one is given two binary classifiers and a set of test data, it should be straightforward to determine which of the two classifiers is the superior. Recent work, however, has called into question many of the methods heretofore accepted as standard for this task. In this paper, we analyze seven ways of determining if one classifier is better than another, given the same test data. Five of these are long established and two are relative newcomers. We review and extend work showing that one of these methods is clearly inappropriate, and then conduct an empirical analysis with a large number of datasets to evaluate the real-world implications of our theoretical analysis. Both our empirical and theoretical results converge strongly towards one of the newer methods.
Keywords :
data handling; pattern classification; set theory; binary classifiers; empirical analysis; performance measurement analysis; real-world implications; test data set; Accuracy; Communities; Equations; Loss measurement; Vectors; Weight measurement; Classifier Evaluation; Performance Metrics; Supervised Learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining (ICDM), 2011 IEEE 11th International Conference on
Conference_Location :
Vancouver,BC
ISSN :
1550-4786
Print_ISBN :
978-1-4577-2075-8
Type :
conf
DOI :
10.1109/ICDM.2011.21
Filename :
6137256
Link To Document :
بازگشت