• DocumentCode
    27884
  • Title

    ClubCF: A Clustering-Based Collaborative Filtering Approach for Big Data Application

  • Author

    Rong Hu ; Wanchun Dou ; Jianxun Liu

  • Author_Institution
    Dept. of Comput. Sci. & Technol., Nanjing Univ., Nanjing, China
  • Volume
    2
  • Issue
    3
  • fYear
    2014
  • fDate
    Sept. 2014
  • Firstpage
    302
  • Lastpage
    313
  • Abstract
    Spurred by service computing and cloud computing, an increasing number of services are emerging on the Internet. As a result, service-relevant data become too big to be effectively processed by traditional approaches. In view of this challenge, a clustering-based collaborative filtering approach is proposed in this paper, which aims at recruiting similar services in the same clusters to recommend services collaboratively. Technically, this approach is enacted around two stages. In the first stage, the available services are divided into small-scale clusters, in logic, for further processing. At the second stage, a collaborative filtering algorithm is imposed on one of the clusters. Since the number of the services in a cluster is much less than the total number of the services available on the web, it is expected to reduce the online execution time of collaborative filtering. At last, several experiments are conducted to verify the availability of the approach, on a real data set of 6225 mashup services collected from ProgrammableWeb.
  • Keywords
    Big Data; cloud computing; collaborative filtering; pattern clustering; service-oriented architecture; ClubCF; Internet; ProgrammableWeb; big data application; cloud computing; clustering-based collaborative filtering approach; mashup services; online execution time; service computing; service-relevant data; small-scale clusters; Cloud computing; Clustering algorithms; Data handling; Data storage systems; Filtering; Information management; Mashups; Big data application; cluster; collaborative filtering; mashup;
  • fLanguage
    English
  • Journal_Title
    Emerging Topics in Computing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    2168-6750
  • Type

    jour

  • DOI
    10.1109/TETC.2014.2310485
  • Filename
    6763038