• DocumentCode
    3745816
  • Title

    Range-Based Clustering Supporting Similarity Search in Big Data

  • Author

    Trong Nhan Phan; J?ger; Nadschl?ger; K?ng

  • Author_Institution
    Inst. for Applic. Oriented Knowledge Process., Johannes Kepler Univ., Linz, Austria
  • fYear
    2015
  • Firstpage
    120
  • Lastpage
    124
  • Abstract
    Thanks to state-of-the-art technologies, we have more and more modern infrastructures as well as automatic processes supporting the agricultural domain. Data collected from parcels by these systems and remote sensors for further analysis result in facing the three main challenges which are known as big volume, big variety, and big velocity, in the era of big data. In terms of similarity search, we propose a range-based clustering method that finds objects which are the most similar compared to the given object in a large-scale computing with Map Reduce. The proposed method groups objects into different clusters which are considered as pivots to perform pre-checking before computing similarity. Furthermore, we conduct some basic experiments to evaluate the performance of the proposed method and observe the influences of the clusters in similarity search.
  • Keywords
    "Big data","Clustering algorithms","Clustering methods","Agriculture","Programming","Search problems","Algorithm design and analysis"
  • Publisher
    ieee
  • Conference_Titel
    Database and Expert Systems Applications (DEXA), 2015 26th International Workshop on
  • ISSN
    1529-4188
  • Print_ISBN
    978-1-4673-7581-8
  • Electronic_ISBN
    2378-3915
  • Type

    conf

  • DOI
    10.1109/DEXA.2015.41
  • Filename
    7406280