• DocumentCode
    3600046
  • Title

    Efficient Parallel Community Detection in Large Edge-Intensive Networks

  • Author

    Guangliang Gao ; Zhan Bu ; Zhiang Wu ; Yuan Li ; Jie Cao

  • Author_Institution
    Coll. of Comput. Sci. & Eng., Nanjing Univ. of Sci. & Technol., Nanjing, China
  • fYear
    2014
  • Firstpage
    253
  • Lastpage
    260
  • Abstract
    Community detection is a classic and very difficult task in social network analysis. A large number of methods have been developed for both efficient and effective community detection. However, much of the existing methods are heavily dependent on the number of links in the network, and thus they often suffer from the computational inefficiency when meeting large edge-intensive networks. In this paper, we present a novel SIMPLifying and Ensembling (SIMPLE) framework for parallel community detection. It employs the random link sampling to simplify the network and obtain basic partitionings on every sampled graphs. Then, the K-means-based Consensus Clustering is used to ensemble a number of basic partitionings to get high-quality community structures. Meanwhile, steps of random sampling and sampled graph partitioning are encapsulated into MapReduce to further improve the efficiency. Experiments on four real-world social networks analyze key parameters and factors inside SIMPLE, and demonstrate the effectiveness of the SIMPLE.
  • Keywords
    complex networks; graph theory; information analysis; network theory (graphs); parallel programming; pattern clustering; sampling methods; social networking (online); MapReduce; SIMPLE framework; k-means-based consensus clustering; large edge-intensive networks; parallel community detection; random sampling; sampled graph partitioning; simplifying and ensembling framework; social network analysis; Approximation algorithms; Clustering algorithms; Communities; Image edge detection; Partitioning algorithms; Twitter; Social Network; Community Detection; Random;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Cloud and Big Data (CBD), 2014 Second International Conference on
  • Print_ISBN
    978-1-4799-8086-4
  • Type

    conf

  • DOI
    10.1109/CBD.2014.52
  • Filename
    7176102