DocumentCode
3234251
Title
Filtering contents by efficient evaluation of XPath expressions
Author
Yuriyama, Madoka ; Nakamura, Hiroaki
Author_Institution
Tokyo Res. Lab., IBM Japan Ltd., Japan
fYear
2003
fDate
27-31 Jan. 2003
Firstpage
148
Lastpage
155
Abstract
The need for selection of Contents based on personal data is increasing. Filtering systems using XML documents as users´ profiles and XPath as contents´ conditions are developed. They need efficient algorithms for evaluating a large number of XPath. However existing algorithms have limitations in that (1) they cannot handle the full XPath language because their execution models are event-based, (2) they require time proportional to the number of XPath expressions, and (3) they do not allow us to add or remove XPath expressions incrementally. We present algorithms that solve all of these problems. The main idea of the algorithms is (1) to construct a data structure that represents multiple XPath expressions in order to evaluate shared subexpressions only once and (2) to repeatedly apply a conventional XPath processor to the data structure. We show that our algorithm is efficient by evaluating the performance of a prototype implementation.
Keywords
hypermedia markup languages; information filters; user modelling; XML documents; XPath expressions; XPath language; execution models; personal data; Automata; Data structures; Filtering; Information retrieval; Internet; Laboratories; Pattern matching; Prototypes; Writing; XML;
fLanguage
English
Publisher
ieee
Conference_Titel
Applications and the Internet, 2003. Proceedings. 2003 Symposium on
Print_ISBN
0-7695-1872-9
Type
conf
DOI
10.1109/SAINT.2003.1183043
Filename
1183043
Link To Document