DocumentCode
2226599
Title
Matrix dimensionality reduction for mining Web logs
Author
Lu, Jianjiang ; Xu, Baowen ; Yang, Hongji
Author_Institution
Dept. of Comput. Sci. & Eng., Southeast Univ., Nanjing, China
fYear
2003
fDate
13-17 Oct. 2003
Firstpage
405
Lastpage
408
Abstract
Web-based logs contain potentially useful data with which designers can assess the usability and effectiveness of their choices. Clustering techniques have been used to automatically discover typical user profiles from Web access logs recently. But it is a challenging problem to design effective similarity measure between the session vectors, which are usually high dimensional and sparse. Nonnegative matrix factorisation approaches are applied to dimensionality reduction of the session-URL matrix, and the spherical k-means algorithm is used to partition the projecting vectors of the user session vectors into several clusters. Two methods for discovering typical user session profiles from the clusters are presented last. The results of experiment show that our algorithms can mine interesting user profiles effectively.
Keywords
Web sites; customer profiles; data mining; matrix decomposition; pattern clustering; vectors; Clustering techniques; Web log mining; Web sites; matrix dimensionality reduction; nonnegative matrix factorization; session-URL matrix; spherical k-means algorithm; user session profiles; user session vectors partitioning; Clustering algorithms; Computer science; Data mining; Educational technology; Laboratories; Partitioning algorithms; Programmable logic arrays; Sparse matrices; Uniform resource locators; Web server;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence, 2003. WI 2003. Proceedings. IEEE/WIC International Conference on
Print_ISBN
0-7695-1932-6
Type
conf
DOI
10.1109/WI.2003.1241222
Filename
1241222
Link To Document