Title :
On-line generation of suggestions for Web users
Author :
Silvestri, Fabrizio ; Baraglia, Ranieri ; Palmerini, Paolo ; Serranò, Massimo
Author_Institution :
Inf. Sci. & Technol. Inst., National Res. council, Pisa, Italy
Abstract :
The knowledge extracted from the analysis of historical information of a Web server can be used to develop personalization or recommendation systems. Web usage mining (WUM) systems are specifically designed to carry out this task by analyzing the data representing usage data about a particular Web site. Typically these systems are composed by two parts. One, executed offline, that analyze the server access logs in order to find a suitable categorization, and another executed online which is aimed at classifying the active requests, according to the previous offline analysis. In this paper we propose a WUM recommendation system, implemented as a module of the Apache Web server that is able to dynamically generate suggestions to pages that have not yet been visited by a user and might be of his potential interest. Differently from previously proposed WUM systems, SUGGEST 2.0 incrementally builds and maintains the historical information, without the need for an offline component, by means of an incremental graph partitioning algorithm. In the last part, we also analyze the quality of the suggestions generated and the performance of the module implemented. To this purpose we introduce also a new quality metric, which try to estimate the effectiveness of a recommendation system as the capacity of anticipating users´ requests that will be made farther in the future.
Keywords :
Web sites; data mining; information filters; information management; information retrieval; Apache Web server; SUGGEST 2.0; WUM recommendation system; Web site; Web usage mining; Web users; active requests classification; historical information analysis; incremental graph partitioning algorithm; offline analysis; offline execution; online execution; online suggestion generation; personalization systems; quality metric; server access logs analysis; usage data analysis; users requests; Councils; Data analysis; Data mining; Delta modulation; File servers; Information analysis; Information science; Partitioning algorithms; Web page design; Web server;
Conference_Titel :
Information Technology: Coding and Computing, 2004. Proceedings. ITCC 2004. International Conference on
Print_ISBN :
0-7695-2108-8
DOI :
10.1109/ITCC.2004.1286486