• DocumentCode
    2492823
  • Title

    Effective Clustering of Dense and Concentrated Online Communities

  • Author

    Hai, Phan Nhat ; Shin, Hyoseop

  • Author_Institution
    Dept. of Adv. Technol. Fusion, Konkuk Univ., Seoul, South Korea
  • fYear
    2010
  • fDate
    6-8 April 2010
  • Firstpage
    133
  • Lastpage
    139
  • Abstract
    Most clustering algorithms tend to separate large scale online communities into several meaningful sub-communities by extracting cut points and cut edges. However, these algorithms are not effective on dense and concentrated graphs which do not have any meaningful cut points. Common problems with the previous algorithms are as follows. First, the size of the first cluster is too large as it may contain many incompatible users. Second, the quality and the purity of the clusters are very low. Third, only the dominant first cluster is found to be meaningful. To address these problems, we first propose a graph transformation to separate large scale online communities into two different types of meaningful subgraphs. The first subgraph is the intimacy graph and the second is the reputation graph. Then, we present the effective algorithms for discovering good sub-communities and for excluding incompatible users in these subgraphs. The experimental results show that our algorithms allow for extracting more suitable and meaningful sub-communities than the previous work in dense online networks.
  • Keywords
    graph theory; pattern clustering; social networking (online); cut edge extraction; cut point extraction; graph transformation; intimacy graph; online community; reputation graph; Clustering algorithms; IP networks; Internet; Large-scale systems; Noise generators; Satellites; Social network services; Turning; Social network; dense community; effective clustering; intimacy community; reputation community;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Conference (APWEB), 2010 12th International Asia-Pacific
  • Conference_Location
    Busan
  • Print_ISBN
    978-1-7695-4012-2
  • Electronic_ISBN
    978-1-4244-6600-9
  • Type

    conf

  • DOI
    10.1109/APWeb.2010.73
  • Filename
    5474145