• Title of article

    Probabilistic enhancement of approximate indexing in metric spaces

  • Author/Authors

    Takao Murakami، نويسنده , , Kenta Takahashi، نويسنده , , Susumu Serita، نويسنده , , Yasuhiro Fujii، نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 2013
  • Pages
    12
  • From page
    1007
  • To page
    1018
  • Abstract
    Some approximate indexing schemes have been recently proposed in metric spaces which sort the objects in the database according to pseudo-scores. It is known that (1) some of them provide a very good trade-off between response time and accuracy, and (2) probability-based pseudo-scores can provide an optimal trade-off in range queries if the probabilities are correctly estimated. Based on these facts, we propose a probabilistic enhancement scheme which can be applied to any pseudo-score based scheme. Our scheme computes probability-based pseudo-scores using pseudo-scores obtained from a pseudo-score based scheme. In order to estimate the probability-based pseudo-scores, we use the object-specific parameters in logistic regression and learn the parameters using MAP (Maximum a Posteriori) estimation and the empirical Bayes method. We also propose a technique which speeds up learning the parameters using pseudo-scores. We applied our scheme to the two state-of-the-art schemes: the standard pivot-based scheme and the permutation-based scheme, and evaluated them using various kinds of datasets from the Metric Space Library. The results showed that our scheme outperformed the conventional schemes, with regard to both the number of distance computations and the CPU time, in all the datasets.
  • Keywords
    logistic regression , Maximum a posteriori estimation , Similar training object , Metric space indexing , Approximate indexing , Pseudo-score , Probabilistic enhancement , Empirical Bayes method
  • Journal title
    Information Systems
  • Serial Year
    2013
  • Journal title
    Information Systems
  • Record number

    1230341