مرکز منطقه ای اطلاع رساني علوم و فناوري - A maximum entropy approach for collaborative filtering

DocumentCode :

3372858

Title :

A maximum entropy approach for collaborative filtering

Author :

Browning, John ; Miller, David J.

Author_Institution :

Dept. of Electr. Eng., Pennsylvania State Univ., University Park, PA, USA

fYear :

2001

fDate :

2001

Firstpage :

Lastpage :

Abstract :

Collaborative filtering (CF) involves predicting the preferences of a user for a set of items given partial knowledge of the user´s preferences for other items, while leveraging a database of profiles for other users. CF has applications e.g. in predicting Web sites a person will visit and in automated searching of document databases. Fundamentally, CF is a pattern recognition task, but a formidable one, often involving a huge feature space, a large data set, and many missing features. Even more daunting is the fact that a CF inference engine must be capable of predicting any (user-selected) items, given any available set of partial knowledge on the user´s other preferences. In other words, the model must be designed to solve any of a huge (combinatoric) set of possible inference tasks. CF techniques include memory-based, classification-based, and statistical modelling approaches. Among these, modelling approaches scale best with large data sets and are the most adept at handling missing features. The disadvantage of these methods lies in the statistical assumptions (e.g. feature independence), which may be unjustified. To address this shortcoming, we propose a new model-based CF method, based on the maximum entropy principle. For the MS Web application, the new method is demonstrated to outperform a number of CF approaches, including naive Bayes and latent variable (cluster) models, support vector machines (SVMs), and the (Pearson) correlation method

Keywords :

data mining; filtering theory; inference mechanisms; information resources; maximum entropy methods; query processing; statistical analysis; CF inference engine; CF techniques; MS Web application; Pearson correlation method; SVMs; Web site visit prediction; automated searching; collaborative filtering; database of profiles; document databases; feature independence; feature space; large data set; large data sets; latent variable models; maximum entropy approach; maximum entropy principle; missing features; model-based CF method; naive Bayes; partial knowledge; pattern recognition task; possible inference tasks; statistical assumptions; statistical modelling; support vector machines; user preferences; Collaboration; Combinatorial mathematics; Correlation; Electronic mail; Engines; Entropy; Filtering; Pattern recognition; Spatial databases; Support vector machines;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Neural Networks for Signal Processing XI, 2001. Proceedings of the 2001 IEEE Signal Processing Society Workshop

Conference_Location :

North Falmouth, MA

ISSN :

1089-3555

Print_ISBN :

0-7803-7196-8

Type :

conf

DOI :

10.1109/NNSP.2001.943105

Filename :

943105

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3372858