Title :
MyBSP: An Iterative Processing Framework Based on the Cloud Platform for Graph Data
Author :
Chao Liu ; Hong Yao ; Deze Zeng ; Qingzhong Liang ; Chengyu Hu ; Xuesong Yan
Author_Institution :
Sch. of Comput. Sci. & Technol., Huazhong Univ. of Sci. & Technol., Wuhan, China
Abstract :
Massive cloud-based data-intensive applications (e.g., iterative MapReduce-based) could involve graph data processing. How to effectively analyze and process large-scale graph data is an unsolved challenging problem. We present a parallel computation framework, named MyBSP, which is inspired by Google´s Pregel system. MyBSP supports and implements the Bulk Synchronous Parallel (BSP) programming model, and introduces a module of parallel execution unit to achieve iterative processing, which avoids the restart cost of computation jobs, and therefore reduces the I/O overhead (e.g., network communication and disk access). Furthermore, we implement the MyBSP-based PageRank algorithm. Some experiments are conducted to evaluate and compare the performance and scalability of our MyBSP prototype system with MapReduce model. The experimental results show that the speedup in MyBSP compared to MapReduce is about 3.5X for the small-size graph dataset. Meanwhile, the performance improvement of MyBSP also outperforms MapReduce a factor of 2.1 when processing the large-scale dataset.
Keywords :
cloud computing; data analysis; information retrieval; iterative methods; parallel programming; BSP programming model; Google Pregel system; MyBSP framework; MyBSP-based PageRank algorithm; bulk synchronous parallel programming model; cloud platform; graph data analysis; graph data processing; iterative processing framework; large-scale dataset; massive cloud-based data-intensive applications; parallel computation framework; parallel execution; small-size graph dataset; Algorithm design and analysis; Computational modeling; Data models; Data processing; Process control; Servers; Synchronization; BSP model; cloud platform; graph data processing; iterative processing framework;
Conference_Titel :
Advanced Cloud and Big Data (CBD), 2014 Second International Conference on
Print_ISBN :
978-1-4799-8086-4
DOI :
10.1109/CBD.2014.23