DocumentCode :
2225254
Title :
Weighted Instance Based Learner (WIBL) for user profiling
Author :
Cufoglu, Ayse ; Lohi, Mahi ; Everiss, Colin
Author_Institution :
Sch. of Electron. & Comput. Sci., Univ. of Westminster, London, UK
fYear :
2012
fDate :
26-28 Jan. 2012
Firstpage :
201
Lastpage :
205
Abstract :
With an increase in web-based products and services, user profiling has created opportunities for both businesses and other organizations to provide a channel for user awareness as well as to achieve high user satisfaction. Apart from traditional collaborative and content-based methods, a number of classification and clustering algorithms have been used for user profiling. Instance Based Learner (IBL) classifier is a comprehensive form of the Nearest Neighbour (NN) algorithm and it is suitable for user profiling as users with similar profiles are likely to share similar personal interests and preferences. In IBL every attribute has an equal effect on the classification regardless of their relevance. In this paper, we proposed a weighted classification method, namely Weighted Instance Based Learner (WIBL), to build and handle user profiles. With WIBL, we introduce Per Category Feature (PCF) method to IBL in order to distinguish the effect of attributes on classification. PCF is an attribute weighting method and it assigns weights to attributes using conditional probabilities. The direct use of this method with IBL is not possible. Hence, two possible solutions were also proposed to address this problem. This study is aimed to test the performance of WIBL for user profiling. To validate the performance of WIBL, a series of computer simulations were carried out. These simulations were conducted using a large user profile database that includes 5000 training and 1000 test instances (users). Here, each user is represented with three sets of profile information; demographic, interest and preference data. The results illustrate that WIBL with PCF methods performs better than IBL on user profiling by reducing the error up to 28% on the selected dataset.
Keywords :
Internet; learning (artificial intelligence); pattern classification; pattern clustering; probability; user modelling; IBL classifier; NN algorithm; PCF method; WIBL; Web-based products; Web-based services; attribute weighting method; classification algorithms; clustering algorithms; computer simulations; conditional probabilities; demographic data; instance based learner classifier; interest data; nearest neighbour algorithm; per category feature method; preference data; profile information; user awareness; user profiling; user satisfaction; weight assignment; weighted instance based learner; Bayesian methods; Clustering algorithms; Collaboration; Educational institutions; Error analysis; Machine learning; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Applied Machine Intelligence and Informatics (SAMI), 2012 IEEE 10th International Symposium on
Conference_Location :
Herl´any
Print_ISBN :
978-1-4577-0196-2
Type :
conf
DOI :
10.1109/SAMI.2012.6208957
Filename :
6208957
Link To Document :
بازگشت