DocumentCode
2371003
Title
Combining the Web content and usage mining to understand the visitor behavior in a Web site
Author
Velásquez, Juan ; Yasuda, Hiroshi ; Aoki, Terumasa
Author_Institution
Res. Center for Adv. Sci. & Technol., Tokyo Univ., Japan
fYear
2003
fDate
19-22 Nov. 2003
Firstpage
669
Lastpage
672
Abstract
A Web site is a semi structured collection of different kinds of data, whose motivation is to show relevant information to a visitor and in this way capture her/his attention. Understanding the specific preferences that define the visitor behavior in a Web site is a complex task. An approximation is supposed that depends on the content, navigation sequence and time spent in each page visited. These variables can be extracted from the Web log files and the Web site itself, using Web usage and content mining respectively. Combining the described variables, a similarity measure among visitor sessions is introduced and used in a clustering algorithm, which identifies groups of similar sessions, allowing the analysis of visitor behavior. In order to prove the methodology´s effectiveness, it was applied in a certain Web site, showing the benefits of the described approach.
Keywords
Web sites; content management; data mining; information retrieval; self-organising feature maps; Web content; Web log file; Web site; Web usage; clustering algorithm; content mining; navigation sequence; visitor behavior analysis; Algorithm design and analysis; Clustering algorithms; Data mining; Electronic mail; Internet; Navigation; Time measurement; Web mining; Web pages;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining, 2003. ICDM 2003. Third IEEE International Conference on
Print_ISBN
0-7695-1978-4
Type
conf
DOI
10.1109/ICDM.2003.1251004
Filename
1251004
Link To Document