TY - GEN
T1 - Discovering and Visualizing Operations Processes with POD-Discovery and POD-Viz
AU - Weber, Ingo
AU - Li, Chao
AU - Bass, Len
AU - Xu, Xiwei
AU - Zhu, Liming
N1 - Publisher Copyright:
© 2015 IEEE.
PY - 2015/9/14
Y1 - 2015/9/14
N2 - Understanding the behavior of an operations process and capturing it as an abstract process model has been shown to improve dependability significantly [1]. In particular, process context can be used for error detection, diagnosis, and even automated recovery. Creating the process model is an essential step in determining process context and, consequently, improving dependability. This paper describes two systems. The first, POD-Discovery, simplifies the creation of such an abstract process model from operations logs. An activity that previously required many manual steps can now be done largely automatically and in minutes. Using the discovered model, the second system, POD-Viz, provides operators with the ability to visualize the current state of an operations process in near-real-time and to replay a set of events to understand how the process context changed over time. This allows operators to trace the progress of an operations process easily, and helps in analyzing encountered errors.
AB - Understanding the behavior of an operations process and capturing it as an abstract process model has been shown to improve dependability significantly [1]. In particular, process context can be used for error detection, diagnosis, and even automated recovery. Creating the process model is an essential step in determining process context and, consequently, improving dependability. This paper describes two systems. The first, POD-Discovery, simplifies the creation of such an abstract process model from operations logs. An activity that previously required many manual steps can now be done largely automatically and in minutes. Using the discovered model, the second system, POD-Viz, provides operators with the ability to visualize the current state of an operations process in near-real-time and to replay a set of events to understand how the process context changed over time. This allows operators to trace the progress of an operations process easily, and helps in analyzing encountered errors.
KW - Cloud Computing
KW - Dependability
KW - Monitoring
KW - Process Modelling
KW - System Administration
KW - System Operation
UR - http://www.scopus.com/inward/record.url?scp=84950151621&partnerID=8YFLogxK
U2 - 10.1109/DSN.2015.23
DO - 10.1109/DSN.2015.23
M3 - Conference contribution
AN - SCOPUS:84950151621
T3 - Proceedings of the International Conference on Dependable Systems and Networks
SP - 537
EP - 544
BT - Proceedings - 2015 45th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, DSN 2015
PB - IEEE Computer Society
T2 - 45th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, DSN 2015
Y2 - 22 June 2015 through 25 June 2015
ER -