DocumentCode :
3690243
Title :
On the architecture of a big data classification tool based on a map reduce approach for hyperspectral image analysis
Author :
V. A. Ayma;R. S. Ferreira;P. N. Happ;D. A. B. Oliveira;G. A. O. P. Costa;R. Q. Feitosa;A. Plaza;P. Gamba
Author_Institution :
Dept. of Electrical Engineering, Pontifical Catholic University of Rio de Janeiro, Brazil
fYear :
2015
fDate :
7/1/2015 12:00:00 AM
Firstpage :
1508
Lastpage :
1511
Abstract :
Advances in remote sensors are providing exceptional quantities of large-scale data with increasing spatial, spectral and temporal resolutions, raising new challenges in its analysis, e.g. those presents in classification processes. This work presents the architecture of the InterIMAGE Cloud Platform (ICP): Data Mining Package; a tool able to perform supervised classification procedures on huge amounts of data, on a distributed infrastructure. The architecture is implemented on top of the MapReduce framework. The tool has four classification algorithms implemented taken from WEKA´s machine learning library, namely: Decision Trees, Naïve Bayes, Random Forest and Support Vector Machines. The SVM classifier was applied on datasets of different sizes (2 GB, 4 GB and 10 GB) for different cluster configurations (5, 10, 20, 50 nodes). The results show the tool as a potential approach to parallelize classification processes on big data.
Keywords :
"Big data","Iterative closest point algorithm","Data mining","Computer architecture","Cloud computing","Classification algorithms","Training"
Publisher :
ieee
Conference_Titel :
Geoscience and Remote Sensing Symposium (IGARSS), 2015 IEEE International
ISSN :
2153-6996
Electronic_ISBN :
2153-7003
Type :
conf
DOI :
10.1109/IGARSS.2015.7326066
Filename :
7326066
Link To Document :
بازگشت