Title :
Enhancing Web access using data mining techniques
Author :
Vaisman, Alejandro A. ; Dandretta, Gabriel ; Sapia, Mariela
Author_Institution :
Toronto Univ., Ont., Canada
Abstract :
In this paper we study data mining techniques as tools for reducing the time needed to access Web pages in the environment of a corporation where users connect to the Internet through a proxy server. We add a data mining server to the traditional Web architecture. This server computes, using sequential patterns, the pages likely to be requested by the users, taking into account their Web access history. Then, the server loads these pages into the proxy´s cache in order to have them available when they are actually asked for. We describe our implementation and the results obtained, showing that the average access time to a Web page can be dramatically reduced using this technique. Further, we also implemented our proposal using simple association rules, showing that sequential patterns result in a smaller and more accurate set of rules, as a consequence of taking into account the order of the requests. Finally, we discuss the differences with other prefetching proposals.
Keywords :
Internet; Web sites; cache storage; data mining; information retrieval; network servers; Internet; Web access history; Web architecture; Web page access; World Wide Web; association rules; corporation; data mining; prefetching; proxy cache; proxy server; sequential patterns; Association rules; Computer architecture; Data mining; History; Internet; Prefetching; Proposals; Service oriented architecture; Web pages; Web server;
Conference_Titel :
Database and Expert Systems Applications, 2003. Proceedings. 14th International Workshop on
Print_ISBN :
0-7695-1993-8
DOI :
10.1109/DEXA.2003.1232043