• DocumentCode
    63295
  • Title

    MSIDX: Multi-Sort Indexing for Efficient Content-Based Image Search and Retrieval

  • Author

    Tiakas, E. ; Rafailidis, D. ; Dimou, Anastasia ; Daras, Petros

  • Author_Institution
    Inf. Technol. Inst., Centre for Res. & Technol. Hellas, Thessaloniki, Greece
  • Volume
    15
  • Issue
    6
  • fYear
    2013
  • fDate
    Oct. 2013
  • Firstpage
    1415
  • Lastpage
    1430
  • Abstract
    In this paper, a novel approximate indexing scheme for efficient content-based image search and retrieval is presented, called Multi-Sort Indexing (MSIDX). The proposed scheme analyzes high dimensional image descriptor vectors, by employing the value cardinalities of their dimensions. The dimensions´ value cardinalities, an inherent characteristic of descriptor vectors, are the number of discrete values in the dimensions. As expected, value cardinalities significantly vary, due to the existence of several extraction methods. Moreover, different quantization and normalization techniques used in the extraction process have a strong impact on the dimensions´ value cardinalities. Since dimensions with high value cardinalities have more discriminative power, a multiple sort algorithm is used to reorder the descriptors´ dimensions according to their value cardinalities, in order to increase the probability of two similar images to lie within a close constant range. The expected bounds of the constant range are defined in detail, following deterministic and probabilistic analyses. The proposed scheme is fully suitable (a) for real-time indexing of images, and (b) for searching and retrieving relevant images with an efficient query processing algorithm. In our experiments with five real datasets, we show the superiority of the proposed approach against hashing methods, also suitable for approximate similarity search.
  • Keywords
    approximation theory; content-based retrieval; feature extraction; image matching; image retrieval; indexing; probability; quantisation (signal); real-time systems; MSIDX; approximate indexing scheme; approximate similarity search; content-based image retrieval; content-based image search; descriptor dimensions; dimension value cardinalities; extraction process; hashing methods; high dimensional image descriptor vectors; multiple sort algorithm; multisort indexing; normalization techniques; probabilistic analysis; query processing algorithm; real-time image indexing; Accuracy; Approximation methods; Binary codes; Histograms; Indexing; Quantization; Vectors; Approximate similarity search; content-based image retrieval; indexing; multi-sort;
  • fLanguage
    English
  • Journal_Title
    Multimedia, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1520-9210
  • Type

    jour

  • DOI
    10.1109/TMM.2013.2247989
  • Filename
    6466386