DocumentCode :
588704
Title :
A Label-Based Partitioning Strategy for Mining Link Patterns
Author :
Cuifang Zhao ; Xiang Zhang ; Peng Wang
Author_Institution :
Sch. of Comput. Sci. & Eng., Southeast Univ., Nanjing, China
fYear :
2012
fDate :
8-10 Nov. 2012
Firstpage :
203
Lastpage :
206
Abstract :
As the explosive growth of online linked data, the task of mining link patterns attracts more and more attention. A practical issue is how to perform mining efficiently in large-scale linked data. Existing pattern mining algorithms usually assume that the dataset can fit into the main memory, while linked data of billion triples is far beyond the memory limitation. In this paper we give a pilot study of a novel partitioning strategy for mining link patterns in large-scale linked data. First, we propose an algorithm named Par Group to divide and group large linked data to partitions based on vertex label, Second, an adapted gSpan is applied for mining link patterns in each partition, At last, discovered link patterns are merged into a global result set. Experiments show that our strategy is feasible and promising in some scenarios.
Keywords :
data mining; graph theory; merging; pattern clustering; ParGroup algorithm; gSpan; label-based partitioning strategy; large linked data group; large-scale linked data mining; link pattern mining; online linked data; pattern merging; pattern mining algorithms; vertex label; Algorithm design and analysis; Data mining; Databases; Educational institutions; Merging; Partitioning algorithms; Resource description framework; Graph Clustering; Graph Partition; Pattern Mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Knowledge, Information and Creativity Support Systems (KICSS), 2012 Seventh International Conference on
Conference_Location :
Melbourne, VIC
Print_ISBN :
978-1-4673-4564-4
Type :
conf
DOI :
10.1109/KICSS.2012.15
Filename :
6405530
Link To Document :
بازگشت