DocumentCode :
3512316
Title :
Boosted Spectral Embedding (BoSE): Applications to content-based image retrieval of histopathology
Author :
Sridhar, Akshay ; Doyle, Scott ; Madabhushi, Anant
Author_Institution :
Dept. of Biomed. Eng., Rutgers Univ., Piscataway, NJ, USA
fYear :
2011
fDate :
March 30 2011-April 2 2011
Firstpage :
1897
Lastpage :
1900
Abstract :
In machine learning, non-linear dimensionality reduction (NLDR) is commonly used to embed high-dimensional data into a low-dimensional space while preserving local object adjacencies. However, the majority of NLDR methods define object adjacencies using distance metrics that do not account for the quality of the features in the high-dimensional space. In this paper we present Boosted Spectral Embedding (BoSE), a variant of the traditional Spectral Embedding (SE) that utilizes a Boosted Distance Metric (BDM) to improve the low-dimensional representation of the data. Under the naive assumption that all features are equally important, SE uses the Euclidean distance metric to define object-distance relationships. However, the BDM selectively weights features via the AdaBoost algorithm such that the low-dimensional representation contains only the most discriminating features. In this work BoSE is evaluated against SE in the context of digitized histopathology images using (a) content-based image retrieval and (b) classification via Random Forest of the low-dimensional representation. Using images from a cohort of 58 prostate cancer patient studies, BoSE and SE separated benign and malignant samples with areas under the precision-recall curve (AUPRCs) of 0.95 and 0.67 and classification accuracies using a Random Forest (RF) classifer were 0.93 and 0.79, respectively. For a cohort of 55 breast cancer studies, BoSE and SE performed comparably in terms of both RF accuracy and AUPRC. In addition, a qualitative visualization of the low-dimensional data representations suggests that BoSE exhibits improved class separability over SE.
Keywords :
cancer; content-based retrieval; feature extraction; image classification; image representation; image retrieval; learning (artificial intelligence); medical image processing; tumours; AUPRC; AdaBoost algorithm; Euclidean distance metric; Random Forest classification; benign samples; boosted distance metric; boosted spectral embedding; class separability; content-based image retrieval; histopathology; low-dimensional representation; machine learning; malignant samples; nonlinear dimensionality reduction; object-distance relationships; precision-recall curve; prostate cancer; Accuracy; Breast cancer; Feature extraction; Measurement; Prostate cancer; Radio frequency; BoSE; boosting; breast cancer; content-based image retrieval; histopathology; prostate cancer; spectral embedding;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Biomedical Imaging: From Nano to Macro, 2011 IEEE International Symposium on
Conference_Location :
Chicago, IL
ISSN :
1945-7928
Print_ISBN :
978-1-4244-4127-3
Electronic_ISBN :
1945-7928
Type :
conf
DOI :
10.1109/ISBI.2011.5872779
Filename :
5872779
Link To Document :
بازگشت