DocumentCode
3401834
Title
Identifying semantically meaningful sub-communities within Twitter blogosphere
Author
Sotiropoulos, D.N. ; Kounavis, Chris D. ; Giaglis, George M.
Author_Institution
Dept. of Manage. Sci. & Technol., Athens Univ. of Econ. & Bus., Athens, Greece
fYear
2013
fDate
10-12 July 2013
Firstpage
1
Lastpage
8
Abstract
This paper addresses the problem of semantically meaningful group detection within a sub-community of twitter micro-bloggers by utilizing a topic modeling, multi-objective clustering approach. The proposed group detection method is anchored on the Latent Dirichlet Allocation (LDA) topic modeling technique, aiming at identifying clusters of twitter users that are optimal in terms of both spatial and topical compactness. Specifically, the group detection problem is formulated as a multi-objective optimization problem taking into consideration two complementary cluster formation directives. The first objective, related to spatial compactness, is achieved by minimizing the overall deviation from the corresponding cluster centers. The second, related to topical compactness, is achieved by minimizing the portion of probability mass assigned to low probability topics for the corresponding cluster centroids. In our approach, optimization is performed by employing a multi-objective genetic algorithm, which results in a variety of cluster structures that are significantly more interpretable than cluster assignments obtained with traditional single-objective clustering algorithms.
Keywords
genetic algorithms; pattern clustering; social networking (online); LDA topic modeling technique; Twitter blogosphere; Twitter microbloggers; cluster assignments; cluster centers; cluster centroids; complementary cluster formation directives; latent Dirichlet allocation; low probability topics; multiobjective clustering; multiobjective genetic algorithm; multiobjective optimization problem; probability mass; semantically meaningful group detection; semantically meaningful sub-communities identification; spatial compactness; topical compactness; Communities; Context; Optimization; Probability distribution; Semantics; Social network services; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Information, Intelligence, Systems and Applications (IISA), 2013 Fourth International Conference on
Conference_Location
Piraeus
Print_ISBN
978-1-4799-0770-0
Type
conf
DOI
10.1109/IISA.2013.6623727
Filename
6623727
Link To Document