• DocumentCode
    2123935
  • Title

    Deriving Semantic Sessions from Semantic Clusters

  • Author

    Safarkhani, Banafsheh ; Talabeigi, Mojde ; Mohsenzadeh, Mehran ; Meybodi, Mohammad Reza

  • Author_Institution
    Dept. of Comput. Eng., Islamic Azad Univ., Tehran
  • fYear
    2009
  • fDate
    3-5 April 2009
  • Firstpage
    523
  • Lastpage
    528
  • Abstract
    A important phase in any Web personalization system is transaction identification. Recently a number of researches have been done to incorporate semantics of a website in representation of transactions. Building a hierarchy of concepts manually is time consuming and expensive. In this paper we intend to address these shortcomings. Our contribution is that we introduce a mechanism to automatically improve the representation of the user in the Website using a comprehensive lexical semantic resource and semantic clusters. We utilize Wikipedia, the largest encyclopedia to date, as a rich lexical resource to enhance the automatic construction of vector model representation of user sessions. We cluster Web pages based on their content with hierarchical unsupervised fuzzy clustering algorithms ,are effective methods, for exploring the structure of complex real data where grouping of overlapping and vague elements is necessary. Entries in Web server logs are used to identify users and visit sessions, while Web page or resources in the site are clustered based on their content and their semantic. Theses clusters of Web documents are used to scrutinize the discovered web sessions in order to identify what we call sub-sessions. Each subsession have consistent goal. This process engendered to improving deriving semantic sessions from Web site user page views. Our experiments show that proposed system significantly improves the quality of Web personalization process.
  • Keywords
    Web sites; document handling; fuzzy set theory; pattern clustering; personal computing; user interfaces; Web documents; Web pages; Web personalization system; Web server logs; Web site; Wikipedia; hierarchical unsupervised fuzzy clustering algorithms; semantic clusters; semantic resource; semantic sessions; transaction identification; Clustering algorithms; Data mining; Encyclopedias; Feature extraction; Information management; Ontologies; Taxonomy; Web pages; Web server; Wikipedia; Semantic cluster; Semantic sub-session; Semantic vectors; Wikipedia;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Management and Engineering, 2009. ICIME '09. International Conference on
  • Conference_Location
    Kuala Lumpur
  • Print_ISBN
    978-0-7695-3595-1
  • Type

    conf

  • DOI
    10.1109/ICIME.2009.131
  • Filename
    5077090