• DocumentCode
    2087478
  • Title

    On extending the XML engine with query-processing capabilities

  • Author

    Böhm, Klemens

  • Author_Institution
    Databse Res. Group, Swiss Federal Inst. of Technol., Zurich, Switzerland
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    127
  • Lastpage
    138
  • Abstract
    We study how to efficiently evaluate queries over XML documents whose representation is according to the XML specification, i.e., XML files. The software architecture is as follows: the XML engine (i.e., XML parser) makes the structure of the documents explicit. The query processor operates directly on the output of the XML engine. We see two basic alternatives of how such a query processor operates: event-based and tree-based. In the first case, the query processor immediately checks for each event, e.g., begin of an element, if it contributes to a query result or if it invalidates current partial results. In the second case, the query processor generates an explicit transient representation of the document structure and evaluates the query set-at-a-time. This work evaluates these approaches and some optimizations in quantitative terms. Our main results are as follows. The event-based evaluation scheme is approximately 10% faster, even with all the optimizations from this article. The overhead of the query processors is small, compared to the running times of the XML engine. Finally exploiting DTD information in this particular context does not lead to a better performance
  • Keywords
    digital libraries; hypermedia markup languages; query processing; software architecture; trees (mathematics); XML documents; XML engine; digital libraries; event-based evaluation; explicit transient representation; performance; query processing; software architecture; tree-based evaluation; Cost accounting; Data models; Database languages; Engines; Heart; Information systems; Query processing; Software libraries; Terminology; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advances in Digital Libraries, 2000. Proceedings. IEEE
  • Conference_Location
    Washington, DC
  • Print_ISBN
    0-7695-0659-3
  • Type

    conf

  • DOI
    10.1109/ADL.2000.848376
  • Filename
    848376