DocumentCode :
3607807
Title :
Aggregation on the fly: reducing traffic for big data in the cloud
Author :
Huan Ke ; Peng Li ; Song Guo ; Stojmenovic, Ivan
Author_Institution :
Univ. of Aizu, Aizu, Japan
Volume :
29
Issue :
5
fYear :
2015
Firstpage :
17
Lastpage :
23
Abstract :
As a leading framework for processing and analyzing big data, MapReduce is leveraged by many enterprises to parallelize their data processing on distributed computing systems. Unfortunately, the all-to-all data forwarding from map tasks to reduce tasks in the traditional MapReduce framework would generate a large amount of network traffic. The fact that the intermediate data generated by map tasks can be combined with significant traffic reduction in many applications motivates us to propose a data aggregation scheme for MapReduce jobs in cloud. Specifically, we design an aggregation architecture under the existing MapReduce framework with the objective of minimizing the data traffic during the shuffle phase, in which aggregators can reside anywhere in the cloud. Some experimental results also show that our proposal outperforms existing work by reducing the network traffic significantly.
Keywords :
Big Data; cloud computing; data analysis; parallel processing; Big Data analysis; MapReduce; cloud computing; data aggregation scheme; data traffic minimization; distributed computing system; Bandwidth; Big data; Cloud computing; Distributed processing; Network security; Telecommunication traffic; Virtual machining;
fLanguage :
English
Journal_Title :
Network, IEEE
Publisher :
ieee
ISSN :
0890-8044
Type :
jour
DOI :
10.1109/MNET.2015.7293300
Filename :
7293300
Link To Document :
بازگشت