• DocumentCode
    3784734
  • Title

    Unsupervised feature selection applied to content-based retrieval of lung images

  • Author

    J.G. Dy;C.E. Brodley;A. Kak;L.S. Broderick;A.M. Aisen

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Northeastern Univ., Boston, MA, USA
  • Volume
    25
  • Issue
    3
  • fYear
    2003
  • Firstpage
    373
  • Lastpage
    378
  • Abstract
    This paper describes a new hierarchical approach to content-based image retrieval called the "customized-queries" approach (CQA). Contrary to the single feature vector approach which tries to classify the query and retrieve similar images in one step, CQA uses multiple feature sets and a two-step approach to retrieval. The first step classifies the query according to the class labels of the images using the features that best discriminate the classes. The second step then retrieves the most similar images within the predicted class using the features customized to distinguish "subclasses" within that class. Needing to find the customized feature subset for each class led us to investigate feature selection for unsupervised learning. As a result, we developed a new algorithm called FSSEM (feature subset selection using expectation-maximization clustering). We applied our approach to a database of high resolution computed tomography lung images and show that CQA radically improves the retrieval precision over the single feature vector approach. To determine whether our CBIR system is helpful to physicians, we conducted an evaluation trial with eight radiologists. The results show that our system using CQA retrieval doubled the doctors´ diagnostic accuracy.
  • Keywords
    "Image retrieval","Content based retrieval","Lungs","Unsupervised learning","Clustering algorithms","Image databases","Spatial databases","Image resolution","Computed tomography","Information retrieval"
  • Journal_Title
    IEEE Transactions on Pattern Analysis and Machine Intelligence
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2003.1182100
  • Filename
    1182100