Title :
Listing Dense Subgraphs in Small Memory
Author :
Pinto, Patricio ; Cruces, Nataly ; Hernandez, C.
Author_Institution :
Dept. of Comput. Sci., Univ. of Concepcion, Concepcion, Chile
Abstract :
Listing relevant patterns from graphs is becoming increasingly challenging as Web and social graphs are growing in size at a great rate. This scenario requires to process information more efficiently, including the need of processing data that cannot fit in main memory. Typical approaches for processing data using limited main memory include the streaming and external memory models. This paper addresses the problem of listing dense sub graphs from Web and social graphs using little memory. We propose an external memory algorithm based on K-way merge-sort for clustering and reordering input graphs. We also propose mining heuristics that work well with different stream orders such as URL, BFS, and cluster-based. Our experimental evaluation shows that on Web graphs, in comparison with the in-memory algorithm, the streaming mining heuristic is able to find between 70 and 96% of edges participating in dense sub graphs, uses only between 17 and 25% of the memory, and running times are between 34 and 65%. We further consider an application that uses these dense sub graphs for compressing Web graphs with a representation that enables querying the collection of sub graphs for pattern recovery and basic statistics without decompression.
Keywords :
Internet; data mining; graph theory; merging; pattern clustering; sorting; statistics; BFS; URL; Web graphs; cluster-based order; clustering; dense subgraph listing; external memory algorithm; input graph reordering; k-way merge-sort; pattern recovery; processing data; relevant pattern listing; small memory; social graphs; statistics; streaming mining heuristic; Clustering algorithms; Data models; Heuristic algorithms; Memory management; Partitioning algorithms; Social network services; Vegetation; External Memory Algorithms; Graph Pattern Listing; Streaming Algorithms; Web Graphs;
Conference_Titel :
Web Congress (LA-WEB), 2014 9th Latin American
Conference_Location :
Ouro Preto
Print_ISBN :
978-1-4799-6952-4
DOI :
10.1109/LAWeb.2014.16