• DocumentCode
    2535945
  • Title

    XLeaf: Twig Evaluation with Skipping Loop Joins and Virtual Nodes

  • Author

    Grimsmo, Nils ; Bjørklund, Truls A ; Torbjørnsen, Øystein

  • Author_Institution
    Dept. of Comput. & Inf. Sci., Norwegian Univ. of Sci. & Technol. Trondheim, Trondheim, Norway
  • fYear
    2010
  • fDate
    11-16 April 2010
  • Firstpage
    204
  • Lastpage
    213
  • Abstract
    XML indexing and search has become an important topic, and twig joins are key building blocks in XML search systems. This paper describes a novel approach using a nested loop twig join algorithm, which combines several existing techniques to speed up evaluation of XML queries. We combine structural summaries, path indexing and prefix path partitioning to reduce the amount of data read by the join. This effect is amplified by only reading data for leaf query nodes, and inferring data for internal nodes from the structural summary. Skipping is used to speed up merges where query leaves have differing selectivity. Multiple access methods are implemented as materialized views instead of succinct secondary indexes for better locality. This redundancy is made affordable in terms of space by using compression in a back-end with columnar storage. We have implemented an experimental prototype, which shows a speedup of two orders of magnitude on XPath queries with value predicates, when compared to existing open source and commercial systems using a subset of the techniques. Space usage is also improved.
  • Keywords
    XML; indexing; XLeaf; XML indexing; XML queries; XML search systems; multiple access methods; nested loop twig join algorithm; path indexing; prefix path partitioning; structural summaries; twig evaluation; virtual nodes; Application software; Books; Databases; Indexing; Information science; Internet; Partitioning algorithms; Prototypes; Query processing; XML; XML; column store; query processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advances in Databases Knowledge and Data Applications (DBKDA), 2010 Second International Conference on
  • Conference_Location
    Menuires
  • Print_ISBN
    978-1-4244-6081-6
  • Type

    conf

  • DOI
    10.1109/DBKDA.2010.8
  • Filename
    5477123