• DocumentCode
    3008179
  • Title

    A principled approach to mining from noisy logs using Heuristics Miner

  • Author

    Weber, Piotr ; Bordbar, Behzad ; Tino, Peter

  • Author_Institution
    Sch. of Comput. Sci., Univ. of Birmingham, Birmingham, UK
  • fYear
    2013
  • fDate
    16-19 April 2013
  • Firstpage
    119
  • Lastpage
    126
  • Abstract
    Noise is a challenge for process mining algorithms, but there is no standard definition of noise nor accepted way to quantify it. This means it is not possible to mine with confidence from event logs which may not record the underlying process correctly. We discuss one way of thinking about noise in process mining. We consider mining from a `noisy log´ as learning a probability distribution over traces, representing the true process, from a log which is a sample from multiple distributions: the `true´ process model and one or more `noise´ models. We apply this using a probabilistic analysis of the Heuristics Miner algorithm, and demonstrate on a simple example. We show that for a given model it is possible to predict how much data is needed to mine the underlying model without the noise, and identify differences in the the robustness of Heuristics Miner to different types of noise.
  • Keywords
    data mining; learning (artificial intelligence); probability; event logs; heuristics miner algorithm; noisy logs; principled approach; probabilistic analysis; probability distribution; process mining algorithms; Algorithm design and analysis; Business; Data mining; Joints; Noise; Noise measurement; Probabilistic logic;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Data Mining (CIDM), 2013 IEEE Symposium on
  • Conference_Location
    Singapore
  • Type

    conf

  • DOI
    10.1109/CIDM.2013.6597226
  • Filename
    6597226