Title : 
Web user profiling using hierarchical clustering with improved similarity measure
         
        
            Author : 
Algiriyage, Nilani ; Jayasena, Sanath ; Dias, Gihan
         
        
            Author_Institution : 
Dept. of Comput. Sci. & Eng., Univ. of Moratuwa, Moratuwa, Sri Lanka
         
        
        
        
        
        
            Abstract : 
Web user profiling targets grouping users in to clusters with similar interests. Web sites are attracted by many visitors and gaining insight to the patterns of access leaves lot of information. Web server access log files record every single request processed by web site visitors. Applying web usage mining techniques allow to identify interesting patterns. In this paper we have improved the similarity measure proposed by Velásquez et al. [1] and used it as the distance measure in an agglomerative hierarchical clustering for a data set from an online banking web site. To generate profiles, frequent item set mining is applied over the clusters. Our results show that proper visitor clustering can be achieved with the improved similarity measure.
         
        
            Keywords : 
Internet; Web sites; data mining; pattern clustering; Web server access; Web usage mining techniques; Web user profiling; hierarchical clustering; online banking Web site; Data mining; Mathematical model; Navigation; Time measurement; Web pages; Web servers;
         
        
        
        
            Conference_Titel : 
Moratuwa Engineering Research Conference (MERCon), 2015
         
        
            Conference_Location : 
Moratuwa
         
        
        
            DOI : 
10.1109/MERCon.2015.7112362