• DocumentCode
    3234251
  • Title

    Filtering contents by efficient evaluation of XPath expressions

  • Author

    Yuriyama, Madoka ; Nakamura, Hiroaki

  • Author_Institution
    Tokyo Res. Lab., IBM Japan Ltd., Japan
  • fYear
    2003
  • fDate
    27-31 Jan. 2003
  • Firstpage
    148
  • Lastpage
    155
  • Abstract
    The need for selection of Contents based on personal data is increasing. Filtering systems using XML documents as users´ profiles and XPath as contents´ conditions are developed. They need efficient algorithms for evaluating a large number of XPath. However existing algorithms have limitations in that (1) they cannot handle the full XPath language because their execution models are event-based, (2) they require time proportional to the number of XPath expressions, and (3) they do not allow us to add or remove XPath expressions incrementally. We present algorithms that solve all of these problems. The main idea of the algorithms is (1) to construct a data structure that represents multiple XPath expressions in order to evaluate shared subexpressions only once and (2) to repeatedly apply a conventional XPath processor to the data structure. We show that our algorithm is efficient by evaluating the performance of a prototype implementation.
  • Keywords
    hypermedia markup languages; information filters; user modelling; XML documents; XPath expressions; XPath language; execution models; personal data; Automata; Data structures; Filtering; Information retrieval; Internet; Laboratories; Pattern matching; Prototypes; Writing; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Applications and the Internet, 2003. Proceedings. 2003 Symposium on
  • Print_ISBN
    0-7695-1872-9
  • Type

    conf

  • DOI
    10.1109/SAINT.2003.1183043
  • Filename
    1183043