Title :
Evaluation of performance metrics for histopathological image classifier optimization
Author :
Zachariah, Nishant ; Kothari, Sonal ; Ramamurthy, Senthil ; Osunkoya, Adeboye O. ; Wang, May Dongmei
Author_Institution :
Dept. of Electr. & Comput. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
Abstract :
Clinical decision support systems use image processing and machine learning methods to objectively predict cancer in histopathological images. Integral to the development of machine learning classifiers is the ability to generalize from training data to unseen future data. A classification model´s ability to accurately predict class label for new unseen data is measured by performance metrics, which also informs the classifier model selection process. Based on our research, commonly used metrics in literature (such as accuracy, ROC curve) do not accurately reflect the trained model´s robustness. To the best of our knowledge, no research has been conducted to quantitatively compare performance metrics in the context of cancer prediction in histopathological images. In this paper, we evaluate various performance metrics and show that the Lift metric has the highest correlation between internal and external validation sets of a nested cross validation pipeline (R2 = 0.57). Thus, we demonstrate that the Lift metric best generalizes classifier performance among the 23 metrics that were evaluated. Using the lift metric, we develop a classifier with a misclassification rate of 0.25 (4-class classifier) for data that the model was not trained on (external validation).
Keywords :
cancer; decision support systems; image classification; learning (artificial intelligence); medical image processing; optimisation; sensitivity analysis; 4-class classifier; Lift metric; ROC curve; cancer prediction; classification model ability; classifier model selection process; clinical decision support systems; external validation sets; histopathological image classifier optimization; image processing; internal validation sets; machine learning classifiers; misclassification rate; nested cross-validation pipeline; performance metrics evaluation; training data generalization; Cancer; Correlation; Feature extraction; Kernel; Robustness; Sensitivity;
Conference_Titel :
Engineering in Medicine and Biology Society (EMBC), 2014 36th Annual International Conference of the IEEE
Conference_Location :
Chicago, IL
DOI :
10.1109/EMBC.2014.6943990