DocumentCode :
2607796
Title :
Linear model combining by optimizing the Area under the ROC curve
Author :
Tax, David M. J. ; Duin, Robert P. W. ; Arzhaeva, Y.
Author_Institution :
Delft Univ. of Technol.
Volume :
4
fYear :
0
fDate :
0-0 0
Firstpage :
119
Lastpage :
122
Abstract :
In some classification problems, like the detection of illnesses in patients, classes are very unbalanced and the misclassification costs for different classes vary significantly. Then it is better not to minimize the classification error, but to optimize the ordering of the data, or to optimize the area under the ROC curve (AUC). In this paper we propose to optimize a linear combination of features (or base model outputs) by optimizing AUC. The advantages are that a relatively small training set is required for the optimization and that the training set can have a large class imbalance. Furthermore, the classifier does not make distributional assumptions, making it very suitable to combine the outputs of base classifiers. In the application of the detection of interstitial lung diseases it is shown to be very advantageous and to outperform standard classification rules
Keywords :
pattern classification; sensitivity analysis; ROC curve; class imbalance; classification error; data ordering optimization; linear feature combination optimization; linear model; receiver operating characteristic; standard classification rule; training set; Area measurement; Biomedical imaging; Costs; Diseases; Electronic mail; Lungs; Pattern recognition; Proposals; Radiography; Vectors; area under the ROC curve; chest radiography; class imbalance; classifiers; combining; pattern recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition, 2006. ICPR 2006. 18th International Conference on
Conference_Location :
Hong Kong
ISSN :
1051-4651
Print_ISBN :
0-7695-2521-0
Type :
conf
DOI :
10.1109/ICPR.2006.766
Filename :
1699796
Link To Document :
بازگشت