Title :
Precision-recall operating characteristic (P-ROC) curves in imprecise environments
Author :
Landgrebe, T.C.W. ; Paclik, P. ; Duin, Robert P. W. ; Bradley, Andrew P.
Author_Institution :
Elect. Eng., Maths & Comp. Sc., Delft Univ. of Technol.
Abstract :
Traditionally, machine learning algorithms have been evaluated in applications where assumptions can be reliably made about class priors and/or misclassification costs. In this paper, we consider the case of imprecise environments, where little may be known about these factors and they may well vary significantly when the system is applied. Specifically, the use of precision-recall analysis is investigated and compared to the more well known performance measures such as error-rate and the receiver operating characteristic (ROC). We argue that while ROC analysis is invariant to variations in class priors, this invariance in fact hides an important factor of the evaluation in imprecise environments. Therefore, we develop a generalised precision-recall analysis methodology in which variation due to prior class probabilities is incorporated into a multi-way analysis of variance (ANOVA). The increased sensitivity and reliability of this approach is demonstrated in a remote sensing application
Keywords :
learning (artificial intelligence); pattern classification; sensitivity analysis; statistical analysis; ANOVA; ROC curve analysis; imprecise environment; machine learning; misclassification cost; precision-recall analysis; precision-recall operating characteristic curve; receiver operating characteristic; variance analysis; Analysis of variance; Area measurement; Australia; Cost function; Machine learning algorithms; Pattern recognition; Performance analysis; Remote sensing; Sampling methods; Surfaces;
Conference_Titel :
Pattern Recognition, 2006. ICPR 2006. 18th International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
0-7695-2521-0
DOI :
10.1109/ICPR.2006.941