• DocumentCode
    3750397
  • Title

    Topic model based behaviour modeling and clustering analysis for wireless network users

  • Author

    Bingjie Leng;Jingchu Liu;Huimin Pan;Sheng Zhou;Zhisheng Niu Tsinghua

  • Author_Institution
    National Laboratory for Information Science and Technology, Department of Electronic Engineering, Tsinghua University, Beijing 100084, China
  • fYear
    2015
  • Firstpage
    410
  • Lastpage
    415
  • Abstract
    User behaviour analysis based on traffic log in wireless networks can be beneficial to many fields in real life: not only for commercial purposes, but also for improving network service quality and social management. We cluster users into groups marked by the most frequently visited websites to find their preferences. In this paper, we propose a user behaviour model based on Topic Model from document classification problems. We use the logarithmic TF-IDF (term frequency - inverse document frequency) weighing to form a high-dimensional sparse feature matrix. Then we apply LSA (Latent semantic analysis) to deduce the latent topic distribution and generate a low-dimensional dense feature matrix. K-means++, which is a classic clustering algorithm, is then applied to the dense feature matrix and several interpretable user clusters are found. Moreover, by combining the clustering results with additional demographical information, including age, gender, and financial information, we are able to uncover more realistic implications from the clustering results.
  • Keywords
    "Clustering algorithms","Internet","Analytical models","Algorithm design and analysis","Sparse matrices","Mobile communication","Wireless networks"
  • Publisher
    ieee
  • Conference_Titel
    Communications (APCC), 2015 21st Asia-Pacific Conference on
  • Type

    conf

  • DOI
    10.1109/APCC.2015.7412547
  • Filename
    7412547