DocumentCode :
719074
Title :
Clustering analysis for large scale data sets
Author :
Singh, Sachin ; Mishra, Ashish
Author_Institution :
Electr. Eng., Indian Inst. of Technol. Roorkee, Roorkee, India
fYear :
2015
fDate :
15-16 May 2015
Firstpage :
1
Lastpage :
4
Abstract :
The real-world big data can be clustered along desired dimensions but it is limited in its applicability to large-scale problems due to its high computational complexity, user´s desire, number of dimensions etc. Recently, many approaches have been proposed to accelerate the large scale data clustering. Unfortunately, these methods usually sacrifice quite a lot of information of the original data; incompetent to produce multiple clustering etc and don´t consider the geometrical, psychological and physiological interpretation of naturally occurring data whose representation may be parts-based in human brain. In this paper seven clustering algorithms are analyzed which is based on large scale data of seven different environment and dimensions to find out a universal framework for the representation and processing of knowledge. Our empirical study shows the encouraging results of the LSC-K algorithm in comparisons to state-of-the-art algorithms.
Keywords :
Big Data; computational complexity; pattern clustering; LSC-K algorithm; clustering analysis; computational complexity; large scale data sets; real-world Big Data; Accuracy; Algorithm design and analysis; Automation; Clustering algorithms; Data mining; Geometry; Mutual information; clustring; data mining; large scale data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computing, Communication & Automation (ICCCA), 2015 International Conference on
Conference_Location :
Noida
Print_ISBN :
978-1-4799-8889-1
Type :
conf
DOI :
10.1109/CCAA.2015.7148353
Filename :
7148353
Link To Document :
بازگشت