Title :
Heavyweight Pattern Mining in Attributed Flow Graphs
Author :
Gomes, Carolina Simoes ; Amaral, Jose Nelson ; Sander, Joerg ; Siu, Joran ; Li Ding
Author_Institution :
Intuit Canada Edmonton, Edmonton, AB, Canada
Abstract :
This paper defines a new problem - heavyweight pattern mining in attributed flow graphs. The problem can be described as the discovery of patterns in flow graphs that have sets of attributes associated with their nodes. A connection between nodes is represented as a directed edge. The amount of load that goes through a path between nodes, or the frequency of transmission of such load between nodes, is represented as edge weights. A heavyweight pattern is a sub-set of attributes, found in a dataset of attributed flow graphs, that are connected by edges and have a computed weight higher than an user-defined threshold. A new algorithm called AFG Miner is introduced, the first one to our knowledge that finds heavyweight patterns in a dataset of attributed flow graphs and associates each pattern with its occurrences. The paper also describes a new tool for compiler engineers, HEP Miner, that applies the AFG Miner algorithm to Profile-based Program Analysis modeled as a heavyweight pattern mining problem.
Keywords :
data flow graphs; data mining; directed graphs; AFG Miner algorithm; HEP Miner; attributed flow graphs; compiler engineers; directed edge; heavyweight pattern mining; profile-based program analysis; user-defined threshold; Algorithm design and analysis; Complexity theory; Data mining; Electronic mail; Flow graphs; Labeling; Servers; data mining; flow graphs; graph mining; pattern mining; program analysis; program profiling; software analysis; sub-graph mining;
Conference_Titel :
Data Mining (ICDM), 2014 IEEE International Conference on
Conference_Location :
Shenzhen
Print_ISBN :
978-1-4799-4303-6
DOI :
10.1109/ICDM.2014.51