Title :
MMap: Fast billion-scale graph computation on a PC via memory mapping
Author :
Zhiyuan Lin ; Minsuk Kahng ; Sabrin, Kaeser Md ; Chau, Duen Horng Polo ; Ho Lee ; Kang, U.
Author_Institution :
Georgia Tech, Atlanta, GA, USA
Abstract :
Graph computation approaches such as GraphChi and TurboGraph recently demonstrated that a single PC can perform efficient computation on billion-node graphs. To achieve high speed and scalability, they often need sophisticated data structures and memory management strategies. We propose a minimalist approach that forgoes such requirements, by leveraging the fundamental memory mapping (MMap) capability found on operating systems. We contribute: (1) a new insight that MMap is a viable technique for creating fast and scalable graph algorithms that surpasses some of the best techniques; (2) the design and implementation of popular graph algorithms for billion-scale graphs with little code, thanks to memory mapping; (3) extensive experiments on real graphs, including the 6.6 billion edge Yahoo Web graph, and show that this new approach is significantly faster or comparable to the highly-optimized methods (e.g., 9.5X faster than GraphChi for computing PageRank on 1.47B edge Twitter graph). We believe our work provides a new direction in the design and development of scalable algorithms. Our packaged code is available at http://poloclub.gatech.edu/mmap/.
Keywords :
Internet; data structures; graph theory; graphs; operating system kernels; storage management; GraphChi; MMap; PC; PageRank; TurboGraph; Yahoo Web graph; billion-node graphs; billion-scale graph computation; billion-scale graphs; memory management strategies; memory mapping; operating systems; packaged code; scalable graph algorithms; sophisticated data structures; Algorithm design and analysis; Data structures; Memory management; Random access memory; Runtime; Twitter; Vectors;
Conference_Titel :
Big Data (Big Data), 2014 IEEE International Conference on
Conference_Location :
Washington, DC
DOI :
10.1109/BigData.2014.7004226