DocumentCode
2123935
Title
Deriving Semantic Sessions from Semantic Clusters
Author
Safarkhani, Banafsheh ; Talabeigi, Mojde ; Mohsenzadeh, Mehran ; Meybodi, Mohammad Reza
Author_Institution
Dept. of Comput. Eng., Islamic Azad Univ., Tehran
fYear
2009
fDate
3-5 April 2009
Firstpage
523
Lastpage
528
Abstract
A important phase in any Web personalization system is transaction identification. Recently a number of researches have been done to incorporate semantics of a website in representation of transactions. Building a hierarchy of concepts manually is time consuming and expensive. In this paper we intend to address these shortcomings. Our contribution is that we introduce a mechanism to automatically improve the representation of the user in the Website using a comprehensive lexical semantic resource and semantic clusters. We utilize Wikipedia, the largest encyclopedia to date, as a rich lexical resource to enhance the automatic construction of vector model representation of user sessions. We cluster Web pages based on their content with hierarchical unsupervised fuzzy clustering algorithms ,are effective methods, for exploring the structure of complex real data where grouping of overlapping and vague elements is necessary. Entries in Web server logs are used to identify users and visit sessions, while Web page or resources in the site are clustered based on their content and their semantic. Theses clusters of Web documents are used to scrutinize the discovered web sessions in order to identify what we call sub-sessions. Each subsession have consistent goal. This process engendered to improving deriving semantic sessions from Web site user page views. Our experiments show that proposed system significantly improves the quality of Web personalization process.
Keywords
Web sites; document handling; fuzzy set theory; pattern clustering; personal computing; user interfaces; Web documents; Web pages; Web personalization system; Web server logs; Web site; Wikipedia; hierarchical unsupervised fuzzy clustering algorithms; semantic clusters; semantic resource; semantic sessions; transaction identification; Clustering algorithms; Data mining; Encyclopedias; Feature extraction; Information management; Ontologies; Taxonomy; Web pages; Web server; Wikipedia; Semantic cluster; Semantic sub-session; Semantic vectors; Wikipedia;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Management and Engineering, 2009. ICIME '09. International Conference on
Conference_Location
Kuala Lumpur
Print_ISBN
978-0-7695-3595-1
Type
conf
DOI
10.1109/ICIME.2009.131
Filename
5077090
Link To Document