Title :
A Framework for Cloud-Based Large-Scale Data Analytics and Visualization: Case Study on Multiscale Climate Data
Author :
Lu, Sifei ; Li, Reuben Mingguang ; Tjhi, William Chandra ; Lee, Kee Khoon ; Wang, Long ; Li, Xiarong ; Ma, Di
Author_Institution :
Inst. of High Performance Comput., Agency for Sci., Technol. & Res., Singapore, Singapore
fDate :
Nov. 29 2011-Dec. 1 2011
Abstract :
In this paper, we present a cloud framework to provide cloud clustering, workflow scheduling and management, fault tolerance and distributed data storage, data analytics and visualisation services. Using a practical case study, we show that in the process of analyzing multiscale climate data, typical problems plaguing data analysts are faced. These include large datasets and limited computational resources, data complexity and limited knowledge, and varying data structures/formats and the need to integrate different tools. The implementation of our framework to climate studies was a success. This can be seen in its ability to perform spatio-temporal data analysis and visualization of a large multi-dimensional climate dataset with reduced processing time. The framework demonstrates great flexibility and simplicity for end users intending to perform data analysis by aiding the integration of data and tools and enabling interactive visualization on-the-fly. This is coupled with effective utilization of computational resources and data storage systems.
Keywords :
cloud computing; computational complexity; data analysis; data integration; data visualisation; fault tolerance; geographic information systems; spatiotemporal phenomena; cloud clustering; cloud-based large-scale data analytics; computational resource; data complexity; data integration; distributed data storage; fault tolerance; multiscale climate dataset visualization; spatio-temporal data analysis; workflow management; workflow scheduling; Cloud computing; Data mining; Data visualization; Engines; Meteorology; Processor scheduling; Trajectory; climate data analysis; cloud computing; multiscale data; spatio-temporal visualization; workflow scheduling;
Conference_Titel :
Cloud Computing Technology and Science (CloudCom), 2011 IEEE Third International Conference on
Conference_Location :
Athens
Print_ISBN :
978-1-4673-0090-2
DOI :
10.1109/CloudCom.2011.95