• DocumentCode
    2343750
  • Title

    Building a Distributed Generic Recommender Using Scalable Data Mining Library

  • Author

    Bhatia, Lavannya ; Prasad, S.S.

  • Author_Institution
    Dept. of CSE, JSS Acad. of Tech. Educ., Noida, India
  • fYear
    2015
  • fDate
    13-14 Feb. 2015
  • Firstpage
    98
  • Lastpage
    102
  • Abstract
    Recommender systems produce list of recommended items through content based or collaborative or hybrid combination of these two approaches. The paper presents a generic approach for performing collaborative filtering using data mining techniques to discover relationships among users and items. Using generic model techniques a single recommender system can produce recommendations about a variety of items. The methodologies reported for development of recommender systems are not efficient for generic application. The difference in the implementations of recommender depends upon how they analyze the big input data to recognize the similarity between users and items that indicates the relevant preferences for that user. Generic user based recommender works with data model encapsulating recommender input data in Apache Mahout which is extensible data mining library. The recommender system framework can use any similarity metric. We have chosen Pearson correlation because the computation would be fast. One of the parameters of user based recommender is User-Neighborhood. Fixed-size Neighborhood has the advantage that the recommendations are based on fewer similar users. Hadoop software library allows distributed processing of big data across multiple clusters of nodes. The paper describes an implementation using Apache Mahout and Hadoop and also explores feasible augmentation that can enhance efficiency of recommendation. This paper reports successful implementation of generic recommender.
  • Keywords
    Big Data; collaborative filtering; data mining; data models; parallel processing; recommender systems; software libraries; Apache Mahout; Hadoop software library; Pearson correlation; big data; collaborative filtering; data model; distributed generic recommender; distributed processing; fixed-size neighborhood; generic user-based recommender; recommender system framework; scalable data mining library; user-neighborhood; Collaboration; Conferences; Correlation; Data mining; Data models; Recommender systems; Big data; Hadoop; Mahout; collaborative filtering; data mining; generic recommender; machine learning; recommender systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence & Communication Technology (CICT), 2015 IEEE International Conference on
  • Conference_Location
    Ghaziabad
  • Print_ISBN
    978-1-4799-6022-4
  • Type

    conf

  • DOI
    10.1109/CICT.2015.129
  • Filename
    7078675