DocumentCode :
3234251
Title :
Filtering contents by efficient evaluation of XPath expressions
Author :
Yuriyama, Madoka ; Nakamura, Hiroaki
Author_Institution :
Tokyo Res. Lab., IBM Japan Ltd., Japan
fYear :
2003
fDate :
27-31 Jan. 2003
Firstpage :
148
Lastpage :
155
Abstract :
The need for selection of Contents based on personal data is increasing. Filtering systems using XML documents as users´ profiles and XPath as contents´ conditions are developed. They need efficient algorithms for evaluating a large number of XPath. However existing algorithms have limitations in that (1) they cannot handle the full XPath language because their execution models are event-based, (2) they require time proportional to the number of XPath expressions, and (3) they do not allow us to add or remove XPath expressions incrementally. We present algorithms that solve all of these problems. The main idea of the algorithms is (1) to construct a data structure that represents multiple XPath expressions in order to evaluate shared subexpressions only once and (2) to repeatedly apply a conventional XPath processor to the data structure. We show that our algorithm is efficient by evaluating the performance of a prototype implementation.
Keywords :
hypermedia markup languages; information filters; user modelling; XML documents; XPath expressions; XPath language; execution models; personal data; Automata; Data structures; Filtering; Information retrieval; Internet; Laboratories; Pattern matching; Prototypes; Writing; XML;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Applications and the Internet, 2003. Proceedings. 2003 Symposium on
Print_ISBN :
0-7695-1872-9
Type :
conf
DOI :
10.1109/SAINT.2003.1183043
Filename :
1183043
Link To Document :
بازگشت