DocumentCode
3008179
Title
A principled approach to mining from noisy logs using Heuristics Miner
Author
Weber, Piotr ; Bordbar, Behzad ; Tino, Peter
Author_Institution
Sch. of Comput. Sci., Univ. of Birmingham, Birmingham, UK
fYear
2013
fDate
16-19 April 2013
Firstpage
119
Lastpage
126
Abstract
Noise is a challenge for process mining algorithms, but there is no standard definition of noise nor accepted way to quantify it. This means it is not possible to mine with confidence from event logs which may not record the underlying process correctly. We discuss one way of thinking about noise in process mining. We consider mining from a `noisy log´ as learning a probability distribution over traces, representing the true process, from a log which is a sample from multiple distributions: the `true´ process model and one or more `noise´ models. We apply this using a probabilistic analysis of the Heuristics Miner algorithm, and demonstrate on a simple example. We show that for a given model it is possible to predict how much data is needed to mine the underlying model without the noise, and identify differences in the the robustness of Heuristics Miner to different types of noise.
Keywords
data mining; learning (artificial intelligence); probability; event logs; heuristics miner algorithm; noisy logs; principled approach; probabilistic analysis; probability distribution; process mining algorithms; Algorithm design and analysis; Business; Data mining; Joints; Noise; Noise measurement; Probabilistic logic;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence and Data Mining (CIDM), 2013 IEEE Symposium on
Conference_Location
Singapore
Type
conf
DOI
10.1109/CIDM.2013.6597226
Filename
6597226
Link To Document