DocumentCode
162162
Title
Identifying user clicks based on dependency graph
Author
Jun Liu ; Cheng Fang ; Ansari, Nayeem
Author_Institution
Sch. of Inf. & Commun. Eng., Beijing Univ. of Posts & Telecommun., Beijing, China
fYear
2014
fDate
9-10 May 2014
Firstpage
1
Lastpage
5
Abstract
Identifying user clicks from a large number of measured HTTP requests is the fundamental task for web usage mining, which is important for web administrators and developers. Nowadays, the prevalent parallel web browsing behavior caused by multi-tab web browsers renders accurate user click identification from massive requests a great challenge. In this paper, we propose a dependency graph model to describe the complicated web browsing behavior. Based on this model, we develop two algorithms to establish the dependency graph for measured requests, and identify user clicks by comparing their probabilities of being primary requests with a self-learned threshold. We evaluate our method with a large dataset collected from a real world mobile core network. The experimental results show that our method can achieve high accurate user clicks identification.
Keywords
Internet; data mining; graph theory; hypermedia; parallel processing; transport protocols; HTTP; Web administrators; Web usage mining; dependency graph; mobile core network; multitab Web browsers; parallel Web browsing; self learned threshold; user click identification; Accuracy; Browsers; Cleaning; Data mining; Mathematical model; Mobile communication; Web pages;
fLanguage
English
Publisher
ieee
Conference_Titel
Wireless and Optical Communication Conference (WOCC), 2014 23rd
Conference_Location
Newark, NJ
Type
conf
DOI
10.1109/WOCC.2014.6839915
Filename
6839915
Link To Document