• DocumentCode
    3008205
  • Title

    Wanna improve process mining results?

  • Author

    Bose, R. P. Jagadeesh Chandra ; Mans, Ronny S. ; van der Aalst, Wil M. P.

  • Author_Institution
    Eindhoven Univ. of Technol., Eindhoven, Netherlands
  • fYear
    2013
  • fDate
    16-19 April 2013
  • Firstpage
    127
  • Lastpage
    134
  • Abstract
    The growing interest in process mining is fueled by the increasing availability of event data. Process mining techniques use event logs to automatically discover process models, check conformance, identify bottlenecks and deviations, suggest improvements, and predict processing times. Lion´s share of process mining research has been devoted to analysis techniques. However, the proper handling of problems and challenges arising in analyzing event logs used as input is critical for the success of any process mining effort. In this paper, we identify four categories of process characteristics issues that may manifest in an event log (e.g. process problems related to event granularity and case heterogeneity) and 27 classes of event log quality issues (e.g., problems related to timestamps in event logs, imprecise activity names, and missing events). The systematic identification and analysis of these issues calls for a consolidated effort from the process mining community. Five real-life event logs are analyzed to illustrate the omnipresence of process and event log issues. We hope that these findings will encourage systematic logging approaches (to prevent event log issues), repair techniques (to alleviate event log issues) and analysis techniques (to deal with the manifestation of process characteristics in event logs).
  • Keywords
    business data processing; data mining; analysis techniques; automatic process model discovery; bottleneck identification; case heterogeneity; conformance checking; data quality; deviation identification; event data; event granularity; event log analysis; event log quality; imprecise activity names; imprecise timestamps; improvement suggestion; missing events; missing timestamps; process mining; processing time prediction; repair techniques; Business; Computational intelligence; Data mining; Educational institutions; Electronic mail; Information systems; Systematics; Data Cleansing; Data Quality; Event Log; Outliers; Preprocessing; Process Mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Data Mining (CIDM), 2013 IEEE Symposium on
  • Conference_Location
    Singapore
  • Type

    conf

  • DOI
    10.1109/CIDM.2013.6597227
  • Filename
    6597227