• DocumentCode
    162162
  • Title

    Identifying user clicks based on dependency graph

  • Author

    Jun Liu ; Cheng Fang ; Ansari, Nayeem

  • Author_Institution
    Sch. of Inf. & Commun. Eng., Beijing Univ. of Posts & Telecommun., Beijing, China
  • fYear
    2014
  • fDate
    9-10 May 2014
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Identifying user clicks from a large number of measured HTTP requests is the fundamental task for web usage mining, which is important for web administrators and developers. Nowadays, the prevalent parallel web browsing behavior caused by multi-tab web browsers renders accurate user click identification from massive requests a great challenge. In this paper, we propose a dependency graph model to describe the complicated web browsing behavior. Based on this model, we develop two algorithms to establish the dependency graph for measured requests, and identify user clicks by comparing their probabilities of being primary requests with a self-learned threshold. We evaluate our method with a large dataset collected from a real world mobile core network. The experimental results show that our method can achieve high accurate user clicks identification.
  • Keywords
    Internet; data mining; graph theory; hypermedia; parallel processing; transport protocols; HTTP; Web administrators; Web usage mining; dependency graph; mobile core network; multitab Web browsers; parallel Web browsing; self learned threshold; user click identification; Accuracy; Browsers; Cleaning; Data mining; Mathematical model; Mobile communication; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Wireless and Optical Communication Conference (WOCC), 2014 23rd
  • Conference_Location
    Newark, NJ
  • Type

    conf

  • DOI
    10.1109/WOCC.2014.6839915
  • Filename
    6839915