DocumentCode
2579481
Title
Implementation of Space Optimized Bisecting K-Means (BKM) Based on Hadoop
Author
Yanshen Yin ; Chengguang Wei ; Guigang Zhang ; Chao Li
Author_Institution
RIIT, TNLIST & CS, Tsinghua Univ., Beijing, China
fYear
2012
fDate
16-18 Nov. 2012
Firstpage
170
Lastpage
175
Abstract
This article is composed in the background of the study of scientific field of coauthors phenomenon factual basis. By the study of massive amounts of relational data, it provides us with major significances theoretically and practically on retrieving and obtaining professionally academic information and getting knowing of academic development trend of miscellaneous fields. In process of studying this type of project, the problem of cluttering for coauthors that are in the data is involved. However, it is hard to meet the need of implementing the analysis of massive amounts of data cluttering by the existing cluttering software and algorithms, for this reason, finding an approach to deal with this kind of question is toughly important. To solve this question, this article presents an optimized Bisecting K-Means (BKM) clustering algorithm based on Hadoop and states the fashion of how to optimize the algorithm and the key point of implementing in details after analyzing the status quo related to this study. Estimating the complexity of the algorithm by experiments indicates the current problems and the direction for the future study.
Keywords
computational complexity; distributed processing; optimisation; pattern clustering; relational databases; BKM; Hadoop; academic development trend; algorithm complexity; coauthors phenomenon factual basis; data cluttering; optimized bisecting k-means clustering algorithm; professionally academic information; relational data; space optimized bisecting k-means; Algorithm design and analysis; Binary trees; Clustering algorithms; Complexity theory; Distributed databases; Indexes; Software algorithms; BKM; Bisecting K-Means; Hadoop; algorithm; clustering; mapreduce;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Information Systems and Applications Conference (WISA), 2012 Ninth
Conference_Location
Haikou
Print_ISBN
978-1-4673-3054-1
Type
conf
DOI
10.1109/WISA.2012.47
Filename
6385205
Link To Document