• DocumentCode
    1124460
  • Title

    A similarity learning approach to content-based image retrieval: application to digital mammography

  • Author

    El-Naqa, Issam ; Yang, Yongyi ; Galatsanos, Nikolas P. ; Nishikawa, Robert M. ; Wernick, Miles N.

  • Author_Institution
    Med. Sch. of Washington Univ., St. Louis, MO, USA
  • Volume
    23
  • Issue
    10
  • fYear
    2004
  • Firstpage
    1233
  • Lastpage
    1244
  • Abstract
    In this paper, we describe an approach to content-based retrieval of medical images from a database, and provide a preliminary demonstration of our approach as applied to retrieval of digital mammograms. Content-based image retrieval (CBIR) refers to the retrieval of images from a database using information derived from the images themselves, rather than solely from accompanying text indices. In the medical-imaging context, the ultimate aim of CBIR is to provide radiologists with a diagnostic aid in the form of a display of relevant past cases, along with proven pathology and other suitable information. CBIR may also be useful as a training tool for medical students and residents. The goal of information retrieval is to recall from a database information that is relevant to the user´s query. The most challenging aspect of CBIR is the definition of relevance (similarity), which is used to guide the retrieval machine. In this paper, we pursue a new approach, in which similarity is learned from training examples provided by human observers. Specifically, we explore the use of neural networks and support vector machines to predict the user´s notion of similarity. Within this framework we propose using a hierarchal learning approach, which consists of a cascade of a binary classifier and a regression module to optimize retrieval effectiveness and efficiency. We also explore how to incorporate online human interaction to achieve relevance feedback in this learning framework. Our experiments are based on a database consisting of 76 mammograms, all of which contain clustered microcalcifications (MCs). Our goal is to retrieve mammogram images containing similar MC clusters to that in a query. The performance of the retrieval system is evaluated using precision-recall curves computed using a cross-validation procedure. Our experimental results demonstrate that: 1) the learning framework can accurately predict the perceptual similarity reported by human observers, thereby se- - rving as a basis for CBIR; 2) the learning-based framework can significantly outperform a simple distance-based similarity metric; 3) the use of the hierarchical two-stage network can improve retrieval performance; and 4) relevance feedback can be effectively incorporated into this learning framework to achieve improvement in retrieval precision based on online interaction with users; and 5) the retrieved images by the network can have predicting value for the disease condition of the query.
  • Keywords
    content-based retrieval; diseases; image retrieval; learning (artificial intelligence); mammography; medical image processing; neural nets; optimisation; radiology; relevance feedback; support vector machines; clustered microcalcifications; content-based image retrieval; cross-validation procedure; digital mammography; disease condition; hierarchal learning approach; neural networks; online human interaction; optimisation; precision-recall curves; radiologists; regression module; relevance feedback; similarity learning approach; support vector machines; Biomedical imaging; Content based retrieval; Displays; Humans; Image databases; Image retrieval; Information retrieval; Mammography; Medical diagnostic imaging; Pathology; Algorithms; Artificial Intelligence; Breast Diseases; Calcinosis; Cluster Analysis; Computer Graphics; Computer Simulation; Database Management Systems; Decision Support Systems, Clinical; Female; Humans; Information Storage and Retrieval; Mammography; Numerical Analysis, Computer-Assisted; Pattern Recognition, Automated; Precancerous Conditions; Radiographic Image Enhancement; Radiographic Image Interpretation, Computer-Assisted; Reproducibility of Results; Sensitivity and Specificity; Signal Processing, Computer-Assisted; Subtraction Technique; User-Computer Interface;
  • fLanguage
    English
  • Journal_Title
    Medical Imaging, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0278-0062
  • Type

    jour

  • DOI
    10.1109/TMI.2004.834601
  • Filename
    1339430