• DocumentCode
    3264454
  • Title

    Parameter estimation using B-trees

  • Author

    Schmidt, Albrecht ; Böhlen, Michael H.

  • Author_Institution
    Dept. of Comput. Sci., Aalborg Univ., Denmark
  • fYear
    2004
  • fDate
    7-9 July 2004
  • Firstpage
    325
  • Lastpage
    333
  • Abstract
    This work presents a method for accelerating algorithms for computing common statistical operations like parameter estimation or sampling on B-tree indexed data; the work was carried out in the context of visualisation of large scientific data sets. The underlying idea is the following: the shape of balanced data structures like B-trees encodes and reflects data semantics according to the balance criterion. For example, clusters in the index attribute are somewhat likely to be present not only on the data or leaf level of the tree but should propagate up into the interior levels. The paper also hints at opportunities and limitations of this approach for visualisation of large data sets. The advantages of the method are manifold. Not only does it enable advanced algorithms through a performance boost for basic operations like density estimation, but it also builds on functionality that is already present to a large degree in current RDBMSs. Additionally, it is fully dynamic and avoids redundancy: when the underlying source data change, the index and therefore the estimations adapt accordingly. Furthermore, we show that the sample quality is data-independent and that it can be modelled by a uniform sampling process if some basic prerequisites are ensured.
  • Keywords
    parameter estimation; relational databases; tree data structures; B-tree indexed data; data structures; parameter estimation; relational database management systems; Acceleration; Computer science; Data structures; Data visualization; Parameter estimation; Redundancy; Relational databases; Sampling methods; Shape; Technology management;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Database Engineering and Applications Symposium, 2004. IDEAS '04. Proceedings. International
  • ISSN
    1098-8068
  • Print_ISBN
    0-7695-2168-1
  • Type

    conf

  • DOI
    10.1109/IDEAS.2004.1319806
  • Filename
    1319806