• DocumentCode
    755443
  • Title

    Probe Minimization by Schedule Optimization: Supporting Top-K Queries with Expensive Predicates

  • Author

    Hwang, Seung-Won ; Chang, Kevin Chen-Chuan

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Pohang Univ. of Sci. & Technol.
  • Volume
    19
  • Issue
    5
  • fYear
    2007
  • fDate
    5/1/2007 12:00:00 AM
  • Firstpage
    646
  • Lastpage
    662
  • Abstract
    This paper addresses the problem of evaluating ranked top-k queries with expensive predicates. As major DBMSs now all support expensive user-defined predicates for Boolean queries, we believe such support for ranked queries can be even more important: first, ranked queries often need to model user-specific concepts of preference, relevance, or similarity, which call for dynamic user-defined functions. Second, middleware systems must incorporate external predicates for integrating autonomous sources typically accessible only by per-object queries. Third, ranked queries often accompany Boolean ranking conditions, which may turn predicates into expensive ones, as the index structure on the predicate built on the base table may be no longer effective in retrieving the filtered objects in order. Fourth, fuzzy joins are inherently expensive, as they are essentially user-defined operations that dynamically associate multiple relations. These predicates, being dynamically defined or externally accessed, cannot rely on index mechanisms to provide zero-time sorted output, and must instead require per-object probe to evaluate. To enable probe minimization, we develop the problem as cost-based optimization of searching over potential probe schedules. In particular, we decouple probe scheduling into object and predicate scheduling problems and develop an analytical object scheduling optimization and a dynamic predicate scheduling optimization, which combined together form a cost-effective probe schedule
  • Keywords
    database management systems; middleware; minimisation; query processing; scheduling; Boolean query; DBMS; database management system; index structure; middleware system; probe minimization; schedule optimization; top-k query; user-defined predicate; Computer science; Database systems; Distributed information systems; Dynamic scheduling; Image retrieval; Information retrieval; Middleware; Optimal scheduling; Probes; Query processing; Database query processing; database systems.; distributed information systems;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2007.1007
  • Filename
    4138202