TY - GEN
T1 - Data transformation and semantic log purging for process mining
AU - Ly, Linh Thao
AU - Indiono, Conrad
AU - Mangler, Jürgen
AU - Rinderle-Ma, Stefanie
N1 - Funding Information:
The work presented in this paper has been partly conducted within the project I743-N23 funded by the Austrian Science Fund (FWF).
PY - 2012
Y1 - 2012
N2 - Existing process mining approaches are able to tolerate a certain degree of noise in the process log. However, processes that contain infrequent paths, multiple (nested) parallel branches, or have been changed in an ad-hoc manner, still pose major challenges. For such cases, process mining typically returns spaghetti-models, that are hardly usable even as a starting point for process (re-)design. In this paper, we address these challenges by introducing data transformation and pre-processing steps that improve and ensure the quality of mined models for existing process mining approaches. We propose the concept of semantic log purging, the cleaning of logs based on domain specific constraints utilizing semantic knowledge which typically complements processes. Furthermore we demonstrate the feasibility and effectiveness of the approach based on a case study in the higher education domain. We think that semantic log purging will enable process mining to yield better results, thus giving process (re-)designers a valuable tool.
AB - Existing process mining approaches are able to tolerate a certain degree of noise in the process log. However, processes that contain infrequent paths, multiple (nested) parallel branches, or have been changed in an ad-hoc manner, still pose major challenges. For such cases, process mining typically returns spaghetti-models, that are hardly usable even as a starting point for process (re-)design. In this paper, we address these challenges by introducing data transformation and pre-processing steps that improve and ensure the quality of mined models for existing process mining approaches. We propose the concept of semantic log purging, the cleaning of logs based on domain specific constraints utilizing semantic knowledge which typically complements processes. Furthermore we demonstrate the feasibility and effectiveness of the approach based on a case study in the higher education domain. We think that semantic log purging will enable process mining to yield better results, thus giving process (re-)designers a valuable tool.
KW - Data transformation
KW - Log purging
KW - Process constraints
KW - Process mining
UR - http://www.scopus.com/inward/record.url?scp=84867799450&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-31095-9_16
DO - 10.1007/978-3-642-31095-9_16
M3 - Conference contribution
AN - SCOPUS:84867799450
SN - 9783642310942
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 238
EP - 253
BT - Advanced Information Systems Engineering - 24th International Conference, CAiSE 2012, Proceedings
T2 - 24th International Conference on Advanced Information Systems Engineering, CAiSE 2012
Y2 - 25 June 2012 through 29 June 2012
ER -