• DocumentCode
    2390928
  • Title

    A webpage similarity measure for web sessions clustering using sequence alignment

  • Author

    Azimpour-Kivi, Mozhgan ; Azmi, Reza

  • Author_Institution
    Sch. of Eng. & Sci., Sharif Univ. of Technol., Tehran, Iran
  • fYear
    2011
  • fDate
    15-16 June 2011
  • Firstpage
    20
  • Lastpage
    24
  • Abstract
    Web sessions clustering is a process of web usage mining task that aims to group web sessions with similar trends and usage patterns into clusters. This process is crucial for effective website management, web personalization and developing web recommender systems. Accurate clustering of web sessions is highly dependent to the similarity measure defined to compare web sessions. In this paper, we propose a similarity measure for comparing web sessions. The sequential order of web navigations in sessions is considered using sequence alignment method. Furthermore, we propose to consider the usage similarity of two web sessions based on the time a user spends on a webpage, and also the frequency of visit of each page within the session. The proposed method is validated by clustering a collection of web sessions using an agglomerative clustering technique and comparing the results with available methods. The experimental results show effectiveness of the proposed method to capture the properties of web session data.
  • Keywords
    Web sites; data mining; pattern clustering; recommender systems; Web navigation; Web personalization; Web recommender systems; Web session clustering; Web usage mining task; Webpage similarity measure; Website management; sequence alignment; sequence alignment method; Clustering algorithms; Conferences; Couplings; Data mining; Dynamic programming; Time frequency analysis; Web pages; interestingness of webpage; sequence alignment; web sessions clustering; webpage simialrity measure;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Artificial Intelligence and Signal Processing (AISP), 2011 International Symposium on
  • Conference_Location
    Tehran
  • Print_ISBN
    978-1-4244-9833-8
  • Type

    conf

  • DOI
    10.1109/AISP.2011.5960993
  • Filename
    5960993