• DocumentCode
    3401834
  • Title

    Identifying semantically meaningful sub-communities within Twitter blogosphere

  • Author

    Sotiropoulos, D.N. ; Kounavis, Chris D. ; Giaglis, George M.

  • Author_Institution
    Dept. of Manage. Sci. & Technol., Athens Univ. of Econ. & Bus., Athens, Greece
  • fYear
    2013
  • fDate
    10-12 July 2013
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    This paper addresses the problem of semantically meaningful group detection within a sub-community of twitter micro-bloggers by utilizing a topic modeling, multi-objective clustering approach. The proposed group detection method is anchored on the Latent Dirichlet Allocation (LDA) topic modeling technique, aiming at identifying clusters of twitter users that are optimal in terms of both spatial and topical compactness. Specifically, the group detection problem is formulated as a multi-objective optimization problem taking into consideration two complementary cluster formation directives. The first objective, related to spatial compactness, is achieved by minimizing the overall deviation from the corresponding cluster centers. The second, related to topical compactness, is achieved by minimizing the portion of probability mass assigned to low probability topics for the corresponding cluster centroids. In our approach, optimization is performed by employing a multi-objective genetic algorithm, which results in a variety of cluster structures that are significantly more interpretable than cluster assignments obtained with traditional single-objective clustering algorithms.
  • Keywords
    genetic algorithms; pattern clustering; social networking (online); LDA topic modeling technique; Twitter blogosphere; Twitter microbloggers; cluster assignments; cluster centers; cluster centroids; complementary cluster formation directives; latent Dirichlet allocation; low probability topics; multiobjective clustering; multiobjective genetic algorithm; multiobjective optimization problem; probability mass; semantically meaningful group detection; semantically meaningful sub-communities identification; spatial compactness; topical compactness; Communities; Context; Optimization; Probability distribution; Semantics; Social network services; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information, Intelligence, Systems and Applications (IISA), 2013 Fourth International Conference on
  • Conference_Location
    Piraeus
  • Print_ISBN
    978-1-4799-0770-0
  • Type

    conf

  • DOI
    10.1109/IISA.2013.6623727
  • Filename
    6623727