Title :
Clustering algorithm in literature-based discovery
Author :
Ye, Chunlei ; Leng, Fuhai ; Guo, Xin
Author_Institution :
Nat. Sci. of Libr., Chinese Acad. of Sci., Beijing, China
Abstract :
Literature-based discovery is linking two or more literature concepts that have heretofore not been linked (i.e., disjoint), in order to produce novel, interesting, plausible, and intelligible knowledge. Cluster analysis is the core of literature-based discovery. This paper proposes an improved fuzzy c means (FCM) algorithm based on the analysis of existing clustering analysis of literature-based discovery. The new FCM algorithm mainly focus on the fuzzy degree of membership and make the FCM algorithm achieve better clustering results in despite of the existence of isolated points or low-frequency terms. And because of the relaxation of the normalization condition, the final clustering result is not very sensitive to the number of the pre-determined clusters. The new FCM algorithm takes the low-frequency terms into full account, and reduces the impaction of the pre-determined number of clusters on the final clustering.
Keywords :
fuzzy set theory; pattern clustering; FCM algorithm; cluster analysis; clustering algorithm; fuzzy c means; intelligible knowledge; literature-based discovery; Algorithm design and analysis; Clustering algorithms; Databases; Filtering; Marine animals; Petroleum; Unified modeling language; Fuzzy c means algorithm; Information retrieval; Literature-based discovery; NFCM; Raynaud´s Phenomenon; Subordinative degree;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2010 Seventh International Conference on
Conference_Location :
Yantai, Shandong
Print_ISBN :
978-1-4244-5931-5
DOI :
10.1109/FSKD.2010.5569366