TY - GEN
T1 - A Method for Debugging Process Discovery Pipelines to Analyze the Consistency of Model Properties
AU - Klinkmüller, Christopher
AU - Seeliger, Alexander
AU - Müller, Richard
AU - Pufahl, Luise
AU - Weber, Ingo
N1 - Publisher Copyright:
© 2021, Springer Nature Switzerland AG.
PY - 2021
Y1 - 2021
N2 - Event logs have become a valuable information source for business process management, e.g., when analysts discover process models to inspect the process behavior and to infer actionable insights. To this end, analysts configure discovery pipelines in which logs are filtered, enriched, abstracted, and process models are derived. While pipeline operations are necessary to manage log imperfections and complexity, they might, however, influence the nature of the discovered process model and its properties. Ultimately, not considering this possibility can negatively affect downstream decision making. We hence propose a framework for assessing the consistency of model properties with respect to the pipeline operations and their parameters, and, if inconsistencies are present, for revealing which parameters contribute to them. Following recent literature on software engineering for machine learning, we refer to it as debugging. From evaluating our framework in a real-world analysis scenario based on complex event logs and third-party pipeline configurations, we see strong evidence towards it being a valuable addition to the process mining toolbox.
AB - Event logs have become a valuable information source for business process management, e.g., when analysts discover process models to inspect the process behavior and to infer actionable insights. To this end, analysts configure discovery pipelines in which logs are filtered, enriched, abstracted, and process models are derived. While pipeline operations are necessary to manage log imperfections and complexity, they might, however, influence the nature of the discovered process model and its properties. Ultimately, not considering this possibility can negatively affect downstream decision making. We hence propose a framework for assessing the consistency of model properties with respect to the pipeline operations and their parameters, and, if inconsistencies are present, for revealing which parameters contribute to them. Following recent literature on software engineering for machine learning, we refer to it as debugging. From evaluating our framework in a real-world analysis scenario based on complex event logs and third-party pipeline configurations, we see strong evidence towards it being a valuable addition to the process mining toolbox.
KW - Discovery
KW - Process mining
KW - Uncertainty & sensitivity analysis
UR - http://www.scopus.com/inward/record.url?scp=85115126897&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-85469-0_7
DO - 10.1007/978-3-030-85469-0_7
M3 - Conference contribution
AN - SCOPUS:85115126897
SN - 9783030854683
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 65
EP - 84
BT - Business Process Management - 19th International Conference, BPM 2021, Proceedings
A2 - Polyvyanyy, Artem
A2 - Wynn, Moe Thandar
A2 - Van Looy, Amy
A2 - Reichert, Manfred
PB - Springer Science and Business Media Deutschland GmbH
T2 - 19th International Conference on Business Process Management, BPM 2021
Y2 - 6 September 2021 through 10 September 2021
ER -