• DocumentCode
    3697228
  • Title

    BRDPHHC: A Balance RDF Data Partitioning Algorithm Based on Hybrid Hierarchical Clustering

  • Author

    Yonglin Leng;Zhikui Chen;Fangming Zhong;Hua Zhong

  • Author_Institution
    Sch. of Software Technol., Dalian Univ. of Technol., Dalian, China
  • fYear
    2015
  • Firstpage
    1755
  • Lastpage
    1760
  • Abstract
    Data partitioning is a fundamental step to achieve effective storage and query of RDF big data. This paper presents a balance RDF data partitioning algorithm based on hybrid hierarchical clustering (BRDPHHC), which combines AP and K-means clustering. BRDPHHC´s functionality includes three aspects: (i) a pre-processing step combining nodes compression and nodes remove to reduce the scale of raw data points, (ii) AP clustering algorithm is used to coarsen the RDF graph step by step and produce data blocks, and (iii) K-means algorithm is used for data partitioning finally. Experiments on benchmark datasets demonstrate the effectiveness of the proposed scheme.
  • Keywords
    "Resource description framework","Clustering algorithms","Partitioning algorithms","Algorithm design and analysis","Complexity theory","Sparse matrices","Distributed databases"
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing and Communications (HPCC), 2015 IEEE 7th International Symposium on Cyberspace Safety and Security (CSS), 2015 IEEE 12th International Conferen on Embedded Software and Systems (ICESS), 2015 IEEE 17th International Conference on
  • Type

    conf

  • DOI
    10.1109/HPCC-CSS-ICESS.2015.190
  • Filename
    7336425