Title :
Max-AUC Feature Selection in Computer-Aided Detection of Polyps in CT Colonography
Author :
Jian-Wu Xu ; Suzuki, Kenji
Author_Institution :
Dept. of Radiol., Univ. of Chicago, Chicago, IL, USA
Abstract :
We propose a feature selection method based on a sequential forward floating selection (SFFS) procedure to improve the performance of a classifier in computerized detection of polyps in CT colonography (CTC). The feature selection method is coupled with a nonlinear support vector machine (SVM) classifier. Unlike the conventional linear method based on Wilks´ lambda, the proposed method selected the most relevant features that would maximize the area under the receiver operating characteristic curve (AUC), which directly maximizes classification performance, evaluated based on AUC value, in the computer-aided detection (CADe) scheme. We presented two variants of the proposed method with different stopping criteria used in the SFFS procedure. The first variant searched all feature combinations allowed in the SFFS procedure and selected the subsets that maximize the AUC values. The second variant performed a statistical test at each step during the SFFS procedure, and it was terminated if the increase in the AUC value was not statistically significant. The advantage of the second variant is its lower computational cost. To test the performance of the proposed method, we compared it against the popular stepwise feature selection method based on Wilks´ lambda for a colonic-polyp database (25 polyps and 2624 nonpolyps). We extracted 75 morphologic, gray-level-based, and texture features from the segmented lesion candidate regions. The two variants of the proposed feature selection method chose 29 and 7 features, respectively. Two SVM classifiers trained with these selected features yielded a 96% by-polyp sensitivity at false-positive (FP) rates of 4.1 and 6.5 per patient, respectively. Experiments showed a significant improvement in the performance of the classifier with the proposed feature selection method over that with the popular stepwise feature selection based on Wilks´ lambda that yielded 18.0 FPs per patient at the same sensitivity level.
Keywords :
biological tissues; computer aided analysis; computerised tomography; feature extraction; feature selection; image classification; image segmentation; image texture; medical image processing; sensitivity analysis; statistical analysis; support vector machines; AUC values; CADe; CTC; CTcolonography; SFFS procedure; SVM classifier; Wilks´ lambda; area under the receiver operating characteristic curve; by-polyp sensitivity; classification performance; colonic-polyp database; computational cost; computer-aided detection scheme; conventional linear method; false-positive rates; feature combinations; gray-level-based features; max-AUC feature selection; morphologic features; nonlinear support vector machine classifier; popular stepwise feature selection; segmented lesion candidate regions; sensitivity level; sequential forward floating selection procedure; statistical test; stepwise feature selection method; stopping criteria; texture features; Colon; Databases; Feature extraction; Kernel; Lesions; Sensitivity; Support vector machines; Colonic polyps; computer-aided detection (CADe); feature selection; support vector machines (SVMs);
Journal_Title :
Biomedical and Health Informatics, IEEE Journal of
DOI :
10.1109/JBHI.2013.2278023