• DocumentCode
    13547
  • Title

    A Framework for the Analysis of Process Mining Algorithms

  • Author

    Weber, Piotr ; Bordbar, Behzad ; Tino, Peter

  • Author_Institution
    Sch. of Comput. Sci., Univ. of Birmingham, Birmingham, UK
  • Volume
    43
  • Issue
    2
  • fYear
    2013
  • fDate
    Mar-13
  • Firstpage
    303
  • Lastpage
    317
  • Abstract
    There are many process mining algorithms and representations, making it difficult to choose which algorithm to use or compare results. Process mining is essentially a machine learning task, but little work has been done on systematically analyzing algorithms to understand their fundamental properties, such as how much data are needed for confidence in mining. We propose a framework for analyzing process mining algorithms. Processes are viewed as distributions over traces of activities and mining algorithms as learning these distributions. We use probabilistic automata as a unifying representation to which other representation languages can be converted. We present an analysis of the Alpha algorithm under this framework and experimental results, which show that from the substructures in a model and behavior of the algorithm, the amount of data needed for mining can be predicted. This allows efficient use of data and quantification of the confidence which can be placed in the results.
  • Keywords
    Petri nets; business data processing; data mining; probabilistic automata; Alpha algorithm; Petri nets; business process; data mining; machine learning task; probabilistic automata; process mining algorithms; representation languages; unifying representation; Algorithm design and analysis; Business; Data mining; Machine learning; Machine learning algorithms; Petri nets; Process control; Business processes; Petri nets; machine learning; probabilistic automata; process mining;
  • fLanguage
    English
  • Journal_Title
    Systems, Man, and Cybernetics: Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    2168-2216
  • Type

    jour

  • DOI
    10.1109/TSMCA.2012.2195169
  • Filename
    6202711