• DocumentCode
    245057
  • Title

    Heavyweight Pattern Mining in Attributed Flow Graphs

  • Author

    Gomes, Carolina Simoes ; Amaral, Jose Nelson ; Sander, Joerg ; Siu, Joran ; Li Ding

  • Author_Institution
    Intuit Canada Edmonton, Edmonton, AB, Canada
  • fYear
    2014
  • fDate
    14-17 Dec. 2014
  • Firstpage
    827
  • Lastpage
    832
  • Abstract
    This paper defines a new problem - heavyweight pattern mining in attributed flow graphs. The problem can be described as the discovery of patterns in flow graphs that have sets of attributes associated with their nodes. A connection between nodes is represented as a directed edge. The amount of load that goes through a path between nodes, or the frequency of transmission of such load between nodes, is represented as edge weights. A heavyweight pattern is a sub-set of attributes, found in a dataset of attributed flow graphs, that are connected by edges and have a computed weight higher than an user-defined threshold. A new algorithm called AFG Miner is introduced, the first one to our knowledge that finds heavyweight patterns in a dataset of attributed flow graphs and associates each pattern with its occurrences. The paper also describes a new tool for compiler engineers, HEP Miner, that applies the AFG Miner algorithm to Profile-based Program Analysis modeled as a heavyweight pattern mining problem.
  • Keywords
    data flow graphs; data mining; directed graphs; AFG Miner algorithm; HEP Miner; attributed flow graphs; compiler engineers; directed edge; heavyweight pattern mining; profile-based program analysis; user-defined threshold; Algorithm design and analysis; Complexity theory; Data mining; Electronic mail; Flow graphs; Labeling; Servers; data mining; flow graphs; graph mining; pattern mining; program analysis; program profiling; software analysis; sub-graph mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2014 IEEE International Conference on
  • Conference_Location
    Shenzhen
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4799-4303-6
  • Type

    conf

  • DOI
    10.1109/ICDM.2014.51
  • Filename
    7023408