DocumentCode
3165541
Title
Finding Cohesive Clusters for Analyzing Knowledge Communities
Author
Kandylas, Vasileios ; Upham, S. Phineas ; Ungar, Lyle H.
Author_Institution
Univ. of Pennsylvania, Philadelphia
fYear
2007
fDate
28-31 Oct. 2007
Firstpage
203
Lastpage
212
Abstract
Documents and authors can be clustered into "knowledge communities" based on the overlap in the papers they cite. We introduce a new clustering algorithm, Streemer, which finds cohesive foreground clusters embedded in a diffuse background, and use it to identify knowledge communities as foreground clusters of papers which share common citations. To analyze the evolution of these communities over time, we build predictive models with features based on the citation structure, the vocabulary of the papers, and the affiliations and prestige of the authors. Findings include that scientific knowledge communities tend to grow more rapidly if their publications build on diverse information and if they use a narrow vocabulary.
Keywords
data mining; information networks; text analysis; Streemer; clustering algorithm; cohesive foreground clusters; predictive models; scientific knowledge communities; text mining; Citation analysis; Clustering algorithms; Communities; Computational Intelligence Society; Data mining; Predictive models; Rhetoric; Text mining; Time measurement; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining, 2007. ICDM 2007. Seventh IEEE International Conference on
Conference_Location
Omaha, NE
ISSN
1550-4786
Print_ISBN
978-0-7695-3018-5
Type
conf
DOI
10.1109/ICDM.2007.22
Filename
4470244
Link To Document