Title :
Design and analysis of a content-based pathology image retrieval system
Author :
Zheng, Lei ; Wetzel, Arthur W. ; Gilbertson, John ; Becich, Michael J.
Author_Institution :
Dept. of Pathology, Pittsburgh Univ., PA, USA
Abstract :
A prototype, content-based image retrieval system has been built employing a client/server architecture to access supercomputing power from the physician´s desktop. The system retrieves images and their associated annotations from a networked microscopic pathology image database based on content similarity to user supplied query images. Similarity is evaluated based on four image feature types: color histogram, image texture, Fourier coefficients, and wavelet coefficients, using the vector dot product as a distance metric. Current retrieval accuracy varies across pathological categories depending on the number of available training samples and the effectiveness of the feature set. The distance measure of the search algorithm was validated by agglomerative cluster analysis in light of the medical domain knowledge. Results show a correlation between pathological significance and the image document distance value generated by the computer algorithm. This correlation agrees with observed visual similarity. This validation method has an advantage over traditional statistical evaluation methods when sample size is small and where domain knowledge is important. A multi-dimensional scaling analysis shows a low dimensionality nature of the embedded space for the current test set.
Keywords :
PACS; diseases; image retrieval; image texture; statistical analysis; Fourier coefficients; client/server architecture; cluster analysis; color histogram; computer algorithm; content similarity; content-based pathology image retrieval system; feature set; image document distance value; image feature types; image texture; medical domain knowledge; microscopic pathology image database; multi-dimensional scaling analysis; pathological significance; percomputing power; query images; retrieval accuracy; visual similarity; wavelet coefficients; Clustering algorithms; Content based retrieval; Image analysis; Image databases; Image retrieval; Information retrieval; Microscopy; Network servers; Pathology; Prototypes; Abstracting and Indexing as Topic; Algorithms; Computer Graphics; Computer Simulation; Database Management Systems; Diagnosis, Computer-Assisted; Documentation; Equipment Design; Equipment Failure Analysis; Image Enhancement; Image Interpretation, Computer-Assisted; Information Storage and Retrieval; Medical Records Systems, Computerized; Microscopy; Software; Software Design; Systems Integration; Telepathology; User-Computer Interface;
Journal_Title :
Information Technology in Biomedicine, IEEE Transactions on
DOI :
10.1109/TITB.2003.822952