Title :
Optimal approximation algorithm of virtual machine placement for data latency minimization in cloud systems
Author :
Jian-Jhih Kuo ; Hsiu-Hsien Yang ; Ming-Jer Tsai
Author_Institution :
Dept. of Comput. Sci., Nat. Tsing Hua Univ., Hsinchu, Taiwan
fDate :
April 27 2014-May 2 2014
Abstract :
The MapReduce/Hadoop architecture has become very important and effective in cloud systems because many data-intensive applications are usually required to process big data. In such environments, big data is partitioned and stored over several data nodes; thus, the total completion time of a task would be delayed if the maximum access latency among all pairs of a data node and its assigned computation node is not bounded. Moreover, the computation nodes usually need to communicate with each other for aggregating the computation results; therefore, the maximum access latency among all pairs of assigned computation nodes also needs to be bounded. In the literature, it has been proved that the placement problem of computation nodes (virtual machines) to minimize the maximum access latency among all pairs of a data node and its assigned computation node and among all pairs of assigned computation nodes does not admit any approximation algorithm with a factor smaller than two, whereas no approximation algorithms have been proposed so far. In this paper, we first propose a 3-approximation algorithm for resolving the problem. Subsequently, we close the gap by proposing a 2-approximation algorithm, that is, an optimal approximation algorithm, for resolving the problem in the price of higher time complexity. Finally, we conduct simulations for evaluating the performance of our algorithms.
Keywords :
Big Data; approximation theory; cloud computing; computational complexity; minimisation; virtual machines; 2-approximation algorithm; 3-approximation algorithm; Big Data processing; MapReduce-Hadoop architecture; cloud systems; computation node placement problem; data access latency; data latency minimization; data node; data-intensive applications; maximum access latency minimization; optimal approximation algorithm; time complexity; virtual machine placement; Approximation algorithms; Approximation methods; Big data; Bipartite graph; Conferences; Three-dimensional displays; Virtual machining;
Conference_Titel :
INFOCOM, 2014 Proceedings IEEE
Conference_Location :
Toronto, ON
DOI :
10.1109/INFOCOM.2014.6848063