Title :
On the architecture of a big data classification tool based on a map reduce approach for hyperspectral image analysis
Author :
V. A. Ayma;R. S. Ferreira;P. N. Happ;D. A. B. Oliveira;G. A. O. P. Costa;R. Q. Feitosa;A. Plaza;P. Gamba
Author_Institution :
Dept. of Electrical Engineering, Pontifical Catholic University of Rio de Janeiro, Brazil
fDate :
7/1/2015 12:00:00 AM
Abstract :
Advances in remote sensors are providing exceptional quantities of large-scale data with increasing spatial, spectral and temporal resolutions, raising new challenges in its analysis, e.g. those presents in classification processes. This work presents the architecture of the InterIMAGE Cloud Platform (ICP): Data Mining Package; a tool able to perform supervised classification procedures on huge amounts of data, on a distributed infrastructure. The architecture is implemented on top of the MapReduce framework. The tool has four classification algorithms implemented taken from WEKA´s machine learning library, namely: Decision Trees, Naïve Bayes, Random Forest and Support Vector Machines. The SVM classifier was applied on datasets of different sizes (2 GB, 4 GB and 10 GB) for different cluster configurations (5, 10, 20, 50 nodes). The results show the tool as a potential approach to parallelize classification processes on big data.
Keywords :
"Big data","Iterative closest point algorithm","Data mining","Computer architecture","Cloud computing","Classification algorithms","Training"
Conference_Titel :
Geoscience and Remote Sensing Symposium (IGARSS), 2015 IEEE International
Electronic_ISBN :
2153-7003
DOI :
10.1109/IGARSS.2015.7326066