• DocumentCode
    2457223
  • Title

    Branch Code: A Labeling Scheme for Efficient Query Answering on Trees

  • Author

    Xiao, Yanghua ; Hong, Ji ; Cui, Wanyun ; He, Zhenying ; Wang, Wei ; Feng, Guodong

  • Author_Institution
    Sch. of Comput. Sci., Fudan Univ., Shanghai, China
  • fYear
    2012
  • fDate
    1-5 April 2012
  • Firstpage
    654
  • Lastpage
    665
  • Abstract
    Labeling schemes lie at the core of query processing for many tree-structured data such as XML data that is flooding the web. A labeling scheme that can simultaneously and efficiently support various relationship queries on trees (such as parent/children, descendant/ancestor, etc.), computation of lowest common ancestors (LCA) and update of trees, is desired for effective and efficient management of tree-structured data. Although a variety of labeling schemes such as prefix-based labeling, interval-based labeling and prime-based labeling as well as their variants have been available to us for encoding static and dynamic trees, these labeling schemes usually show weakness in one aspect or another. In this paper, we propose an integer-based labeling scheme branch code as well as its compressed version as our major solution to simultaneously support efficient query processing on both static and dynamic ordered trees with affordable storage cost. The proposed branch code can answer common queries on ordered trees in constant time, which comes at the cost of consuming O(N log N) storage. To reduce storage cost to O(N), a compressed branch code is further developed. We also give a relationship determination algorithm purely using compressed branch code, which is of quite low possibility to produce false positive results as verified by experimental results. With the support of splay trees, branch code can also support dynamic trees so that updates and queries can be implemented with O(log N) amortized cost. All the results above are either theoretically proved or verified by experimental studies.
  • Keywords
    XML; computational complexity; query processing; storage management; tree data structures; LCA; World Wide Web; XML data; amortized cost; compressed branch code; dynamic ordered trees; encoding; integer-based labeling scheme; interval-based labeling; lowest common ancestor; prefix-based labeling; prime-based labeling; query answering; query processing; relationship determination algorithm; relationship queries; splay trees; static ordered trees; storage cost; tree-structured data management; Encoding; Equations; Labeling; Mathematical model; Query processing; Vegetation; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering (ICDE), 2012 IEEE 28th International Conference on
  • Conference_Location
    Washington, DC
  • ISSN
    1063-6382
  • Print_ISBN
    978-1-4673-0042-1
  • Type

    conf

  • DOI
    10.1109/ICDE.2012.71
  • Filename
    6228122