Title of article
Development of Multiple Big Data Analytics Platforms with Rapid Response
Author/Authors
Chang, Bao Rong Department of Computer Science and Information Engineering - National University of Kaohsiung - 700 Kaohsiung University Rd., Nanzih District, Taiwan , Lee, Yun-Da Department of Computer Science and Information Engineering - National University of Kaohsiung - 700 Kaohsiung University Rd., Nanzih District, Taiwan , Liao, Po-Hao Department of Computer Science and Information Engineering - National University of Kaohsiung - 700 Kaohsiung University Rd., Nanzih District, Taiwan
Pages
14
From page
1
To page
14
Abstract
The crucial problem of the integration of multiple platforms is how to adapt for their own computing features so as to execute the assignments most efficiently and gain the best outcome. This paper introduced the new approaches to big data platform, RHhadoop and SparkR, and integrated them to form a high-performance big data analytics with multiple platforms as part of business intelligence (BI) to carry out rapid data retrieval and analytics with R programming. This paper aims to develop the optimization for job scheduling using MSHEFT algorithm and implement the optimized platform selection based on computing features for improving the system throughput significantly. In addition, users would simply give R commands rather than run Java or Scala program to perform the data retrieval and analytics in the proposed platforms. As a result, according to performance index calculated for various methods, although the optimized platform selection can reduce the execution time for the data retrieval and analytics significantly, furthermore scheduling optimization definitely increases the system efficiency a lot.
Keywords
Data Analytics , Development , Rapid Response , Analytics Platforms
Journal title
Scientific Programming
Serial Year
2017
Full Text URL
Record number
2608075
Link To Document