Title :
HadoopWatch: A first step towards comprehensive traffic forecasting in cloud computing
Author :
Yang Peng ; Kai Chen ; Guohui Wang ; Wei Bai ; Zhiqiang Ma ; Lin Gu
Author_Institution :
Hong Kong Univ. of Sci. & Technol., Hong Kong, China
fDate :
April 27 2014-May 2 2014
Abstract :
This paper presents our effort towards comprehensive traffic forecasting for big data applications using external, light-weighted file system monitoring. Our idea is motivated by the key observations that rich traffic demand information already exists in the log and meta-data files of many big data applications, and that such information can be readily extracted through run-time file system monitoring. As the first step, we use Hadoop as a concrete example to explore our methodology and develop a system called HadoopWatch to predict traffic demand of Hadoop applications. We further implement HadoopWatch in our real small-scale testbed with 10 physical servers and 30 virtual machines. Our experiments over a series of MapReduce applications demonstrate that HadoopWatch can forecast the traffic demand with almost 100% accuracy and time advance. Furthermore, it makes no modification of the Hadoop framework, and introduces little overhead to the application performance.
Keywords :
Big Data; cloud computing; meta data; parallel programming; public domain software; telecommunication traffic; virtual machines; Big Data applications; HadoopWatch; MapReduce applications; cloud computing; comprehensive traffic demand forecasting; external-light-weighted file system monitoring; information extraction; log files; meta-data files; physical servers; real-small-scale testbed; traffic demand prediction; virtual machines; Big data; Computers; Conferences; Forecasting; Monitoring; Pipelines; Writing;
Conference_Titel :
INFOCOM, 2014 Proceedings IEEE
Conference_Location :
Toronto, ON
DOI :
10.1109/INFOCOM.2014.6847920