DocumentCode :
2765805
Title :
Mining user access patterns based on Web logs
Author :
Liu, Xiangwei ; He, Pilian ; Yang, Qian
Author_Institution :
Dept. of Comput. Sci., Tianjin Univ.
fYear :
2005
fDate :
1-4 May 2005
Firstpage :
2280
Lastpage :
2283
Abstract :
In this paper, different from usual order, not directly use the maximal forward reference path to mine sequence patterns but use DBSCAN algorithm to cluster the Web pages that have been accessed by users. Then, decide the Web page class that each page belongs to based on heuristic rules. Next, cluster the users who have the same interest in one or some kinds of Web pages. One user can belong to several classes, because the user may be interested in different types of Web pages. Finally, based on theory of sequence patterns mining, mine out user access patterns in each class by GSP algorithm. The benefit of using cluster methods is to find out layers´ or classes´ relationships from data even without any layer information of data. In this way, the user access patterns can be found more precisely
Keywords :
Internet; data mining; DBSCAN algorithm; GSP algorithm; Web logs; Web pages; heuristic rules; maximal forward reference path; sequence patterns mining; user access patterns mining; Clustering algorithms; Computer science; Data mining; Helium; Itemsets; Pattern analysis; Pattern recognition; Spatial databases; Web mining; Web pages;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electrical and Computer Engineering, 2005. Canadian Conference on
Conference_Location :
Saskatoon, Sask.
ISSN :
0840-7789
Print_ISBN :
0-7803-8885-2
Type :
conf
DOI :
10.1109/CCECE.2005.1557444
Filename :
1557444
Link To Document :
بازگشت