DocumentCode
3189880
Title
Utility-Based Web Path Traversal Pattern Mining
Author
Zhou, Lin ; Liu, Ying ; Wang, Jing ; Shi, Yong
fYear
2007
fDate
28-31 Oct. 2007
Firstpage
373
Lastpage
380
Abstract
Web usage mining is to discover user traversal patterns of Web pages from Weblog records. Usually, a popular Website may register the Weblog records in the order of hundreds of megabytes every day, which provide rich information about the Web dynamics. Path traversal pattern mining discovers frequent sequential Web accessing patterns from Weblog databases. However, it fails to reflect the different impacts of different Web pages to different users. The difference between Web pages makes a strong impact on the decision-makings in Internet information service applications. Therefore, in this paper, we introduce "utility" into path traversal pattern mining problem. Utility is a measure of how "interesting" or "useful" a Web page is. As a result, it allows Web service providers to quantify the user preferences of different traversal paths. Two-Phase utility mining method is used to discover high utility path traversal patterns. We apply our proposed "high utility path traversal mining" algorithm on a real-world Weblog database, and compare the high utility path traversal patterns with the frequent traversal patterns by a traditional path traversal method. We demonstrated the interesting paths, as well as their significance to the decision making process.
Keywords
Conferences; Data mining; Databases; Decision making; Information analysis; Uniform resource locators; Web and internet services; Web pages; Web server; Web services;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining Workshops, 2007. ICDM Workshops 2007. Seventh IEEE International Conference on
Conference_Location
Omaha, NE
Print_ISBN
978-0-7695-3019-2
Electronic_ISBN
978-0-7695-3033-8
Type
conf
DOI
10.1109/ICDMW.2007.72
Filename
4476694
Link To Document