DocumentCode :
2579481
Title :
Implementation of Space Optimized Bisecting K-Means (BKM) Based on Hadoop
Author :
Yanshen Yin ; Chengguang Wei ; Guigang Zhang ; Chao Li
Author_Institution :
RIIT, TNLIST & CS, Tsinghua Univ., Beijing, China
fYear :
2012
fDate :
16-18 Nov. 2012
Firstpage :
170
Lastpage :
175
Abstract :
This article is composed in the background of the study of scientific field of coauthors phenomenon factual basis. By the study of massive amounts of relational data, it provides us with major significances theoretically and practically on retrieving and obtaining professionally academic information and getting knowing of academic development trend of miscellaneous fields. In process of studying this type of project, the problem of cluttering for coauthors that are in the data is involved. However, it is hard to meet the need of implementing the analysis of massive amounts of data cluttering by the existing cluttering software and algorithms, for this reason, finding an approach to deal with this kind of question is toughly important. To solve this question, this article presents an optimized Bisecting K-Means (BKM) clustering algorithm based on Hadoop and states the fashion of how to optimize the algorithm and the key point of implementing in details after analyzing the status quo related to this study. Estimating the complexity of the algorithm by experiments indicates the current problems and the direction for the future study.
Keywords :
computational complexity; distributed processing; optimisation; pattern clustering; relational databases; BKM; Hadoop; academic development trend; algorithm complexity; coauthors phenomenon factual basis; data cluttering; optimized bisecting k-means clustering algorithm; professionally academic information; relational data; space optimized bisecting k-means; Algorithm design and analysis; Binary trees; Clustering algorithms; Complexity theory; Distributed databases; Indexes; Software algorithms; BKM; Bisecting K-Means; Hadoop; algorithm; clustering; mapreduce;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Information Systems and Applications Conference (WISA), 2012 Ninth
Conference_Location :
Haikou
Print_ISBN :
978-1-4673-3054-1
Type :
conf
DOI :
10.1109/WISA.2012.47
Filename :
6385205
Link To Document :
بازگشت