DocumentCode
3575046
Title
Optimizing the Topologies of Virtual Networks for Cloud-Based Big Data Processing
Author
Cong Xu ; Jiahai Yang ; Hui Yu ; Haizhuo Lin ; Hui Zhang
Author_Institution
Inst. for Network Sci. & Cyberspace, Tsinghua Univ., Beijing, China
fYear
2014
Firstpage
189
Lastpage
196
Abstract
Cloud-based big data platforms are being widely adopted in industry, due to their advantages of facilitating the implementation of big data processing and enabling elastic service framework. Alongside with the widespread adoption of cloud-based MapReduce frameworks, a series of solutions have been proposed to improve the performance of big data services over cloud. Majorities of the existing studies concentrate on optimizing the task scheduling or resource provisioning mechanisms to improve the platform´s data processing or communication performance separately, without an overall consideration of both the performance factors. Moreover, these studies seldom consider the impact of virtual network topologies on the performance of MapReduce workflows. The purpose of this work is to optimize the topologies of virtual networks used in cloud-based MapReduce frameworks. We formulate both data transmission and data processing overhead of a specific cloud-based big data application, describe the optimal deployment of virtual networks as an optimization problem and then design algorithms to solve this problem. Experimental results show that our topology optimization mechanism improves the overall performance of cloud-based big data applications dramatically.
Keywords
Big Data; cloud computing; data communication; optimisation; parallel processing; resource allocation; scheduling; telecommunication network topology; virtual machines; virtual private networks; big data services; cloud-based MapReduce; cloud-based big data processing; data processing; data transmission; elastic service framework; optimal virtual network deployment; resource provisioning mechanism; task scheduling optimization; virtual network topology optimization; Big data; Data communication; Network topology; Optimization; Servers; Topology; MapReduce; OpenStack Neutron; cloud computing; optimal deployment; virtual networks;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Computing and Communications, 2014 IEEE 6th Intl Symp on Cyberspace Safety and Security, 2014 IEEE 11th Intl Conf on Embedded Software and Syst (HPCC,CSS,ICESS), 2014 IEEE Intl Conf on
Print_ISBN
978-1-4799-6122-1
Type
conf
DOI
10.1109/HPCC.2014.36
Filename
7056739
Link To Document