Title :
VENUS: Vertex-centric streamlined graph computation on a single PC
Author :
Jiefeng Cheng ; Qin Liu ; Zhenguo Li ; Wei Fan ; Lui, John C. S. ; Cheng He
Author_Institution :
Huawei Noah´s Ark Lab., Hong Kong, China
Abstract :
Recent studies show that disk-based graph computation on just a single PC can be as highly competitive as cluster-based computing systems on large-scale problems. Inspired by this remarkable progress, we develop VENUS, a disk-based graph computation system which is able to handle billion-scale problems efficiently on a commodity PC. VENUS adopts a novel computing architecture that features vertex-centric “streamlined” processing - the graph is sequentially loaded and the update functions are executed in parallel on the fly. VENUS deliberately avoids loading batch edge data by separating read-only structure data from mutable vertex data on disk. Furthermore, it minimizes random IOs by caching vertex data in main memory. The streamlined processing is realized with efficient sequential scan over massive structure data and fast feeding a large number of update functions. Extensive evaluation on large real-world and synthetic graphs has demonstrated the efficiency of VENUS. For example, VENUS takes just 8 minutes with hard disk for PageRank on the Twitter graph with 1.5 billion edges. In contrast, Spark takes 8.1 minutes with 50 machines and 100 CPUs, and GraphChi takes 13 minutes using fast SSD drive.
Keywords :
graph theory; microprocessor chips; Twitter graph; VENUS; billion-scale problems; cluster based computing systems; disk-based graph computation system; large-scale problems; read-only structure data; single PC; vertex centric streamlined graph computation; vertex-centric streamlined processing; Analytical models; Computational modeling; Load modeling; Memory management; Random access memory; Venus;
Conference_Titel :
Data Engineering (ICDE), 2015 IEEE 31st International Conference on
Conference_Location :
Seoul
DOI :
10.1109/ICDE.2015.7113362