Title :
Efficient large graph pattern mining for big data in the cloud
Author :
Chun-Chieh Chen ; Kuan-Wei Lee ; Chih-Chieh Chang ; De-Nian Yang ; Ming-Syan Chen
Author_Institution :
Grad. Inst. of Networking & Multimedia, Nat. Taiwan Univ., Taipei, Taiwan
Abstract :
Mining big graph data is an important problem in the graph mining research area. Although cloud computing is effective at solving traditional algorithm problems, mining frequent patterns of a massive graph with cloud computing still faces the three challenges: 1) the graph partition problem, 2) asymmetry of information, and 3) pattern-preservation merging. Therefore, this paper presents a new approach, the cloud-based SpiderMine (c-SpiderMine), which exploits cloud computing to process the mining of large patterns on big graph data. The proposed method addresses the above issues for implementing a big graph data mining algorithm in the cloud. We conduct the experiments with three real data sets, and the experimental results demonstrate that c-SpiderMine can significantly reduce execution time with high scalability in dealing with big data in the cloud.
Keywords :
Big Data; cloud computing; data mining; graph theory; merging; Big graph data mining; c-SpiderMine; cloud computing; cloud-based SpiderMine; graph partition problem; information asymmetry; large graph pattern mining; pattern-preservation merging; Cloud computing; Data mining; Information management; Merging; Partitioning algorithms; Scalability; Big data; Cloud computing; Graph pattern mining;
Conference_Titel :
Big Data, 2013 IEEE International Conference on
Conference_Location :
Silicon Valley, CA
DOI :
10.1109/BigData.2013.6691618