DocumentCode :
3745816
Title :
Range-Based Clustering Supporting Similarity Search in Big Data
Author :
Trong Nhan Phan; J?ger; Nadschl?ger; K?ng
Author_Institution :
Inst. for Applic. Oriented Knowledge Process., Johannes Kepler Univ., Linz, Austria
fYear :
2015
Firstpage :
120
Lastpage :
124
Abstract :
Thanks to state-of-the-art technologies, we have more and more modern infrastructures as well as automatic processes supporting the agricultural domain. Data collected from parcels by these systems and remote sensors for further analysis result in facing the three main challenges which are known as big volume, big variety, and big velocity, in the era of big data. In terms of similarity search, we propose a range-based clustering method that finds objects which are the most similar compared to the given object in a large-scale computing with Map Reduce. The proposed method groups objects into different clusters which are considered as pivots to perform pre-checking before computing similarity. Furthermore, we conduct some basic experiments to evaluate the performance of the proposed method and observe the influences of the clusters in similarity search.
Keywords :
"Big data","Clustering algorithms","Clustering methods","Agriculture","Programming","Search problems","Algorithm design and analysis"
Publisher :
ieee
Conference_Titel :
Database and Expert Systems Applications (DEXA), 2015 26th International Workshop on
ISSN :
1529-4188
Print_ISBN :
978-1-4673-7581-8
Electronic_ISBN :
2378-3915
Type :
conf
DOI :
10.1109/DEXA.2015.41
Filename :
7406280
Link To Document :
بازگشت