• DocumentCode
    2120992
  • Title

    The Holodex: Integrating Summarization with the Index

  • Author

    Yang, Hung-chih ; Parker, D. Stott ; Hsiao, Ruey-Lung

  • Author_Institution
    CSD, UCLA, Los Angeles, CA
  • fYear
    0
  • fDate
    0-0 0
  • Firstpage
    23
  • Lastpage
    32
  • Abstract
    In this paper we introduce the Holodex, a `holistic index´ for databases that includes a facility for statistics and aggregate-like computations. The Holodex is an integration of the conventional index and summarization over traversals of the index. It can store customized summaries in its data structure, and in this way it can maintain, and provide fast access to, summarized information. The Holodex rests on the summary-traversal architecture - a customizable summarization scheme for tree indexes. An important property of the summary-traversal architecture is that index structures defining an ordering on data can be augmented to provide extra summary information as well. For example, both tree indexes (such as the B+-tree) and tree-hash hybrids (e.g., multi-level trie hashing and interpolation search tree) define an ordering, and they can be naturally extended to include summary information. This combination of indexing and summarization has a variety of uses, including computation of aggregate functions, rollups, bulk computation, and a variety of kinds of statistics, particularly those that are in some way related to order. More specifically, it is useful for computing non-parametric statistics - including rank statistics and order statistics - as well as direct implementation of queries like basic statistical tests on sample distributions
  • Keywords
    database indexing; nonparametric statistics; tree data structures; Holodex; data structure; holistic database index; sample distributions; statistical tests; summarization scheme; summary-traversal architecture; tree indexes; tree-hash hybrids; Aggregates; Computer architecture; Data structures; Databases; Indexes; Indexing; Interpolation; Statistical analysis; Statistical distributions; Statistics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Scientific and Statistical Database Management, 2006. 18th International Conference on
  • Conference_Location
    Vienna
  • ISSN
    1551-6393
  • Print_ISBN
    0-7695-2590-3
  • Type

    conf

  • DOI
    10.1109/SSDBM.2006.52
  • Filename
    1644294