Supporting Undoability in Systems Operations

Ingo Weber, Hiroshi Wada, Alan Fekete, Anna Liu, Len Bass

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

9 Scopus citations

Abstract

When managing cloud resources, many administrators operate without a safety net. For instance, inadvertently deleting a virtual disk results in the complete loss of the contained data. The facility to undo a collection of changes, reverting to a previous acceptable state, is widely recognized as valuable support for dependability. In this paper, we consider the particular needs of the system administrators managing API-controlled resources, such as cloud resources on the IaaS level. In particular, we propose an approach which is based on an abstract model of the effects of each available operation. Using this model, we check to which degree each operation is undoable. A positive outcome of this check means a formal guarantee that any sequence of calls to such operations can be undone. A negative outcome contains information on the properties preventing undoability, e.g., which operations are not undoable and why. At runtime we can then warn the user intending to use an irreversible operation; if undo is possible and desired, we apply an AI planning technique to automatically create a workflow that takes the system back to the desired earlier state. We demonstrate the feasibility and applicability of the approach with a prototypical implementation and a number of experiments.

Original languageEnglish
Title of host publication27th Large Installation System Administration Conference, LISA 2013
PublisherUSENIX Association
Pages75-87
Number of pages13
ISBN (Electronic)9781931971058
StatePublished - 2013
Externally publishedYes
Event27th Large Installation System Administration Conference, LISA 2013 - Washington, United States
Duration: 3 Nov 20138 Nov 2013

Publication series

Name27th Large Installation System Administration Conference, LISA 2013

Conference

Conference27th Large Installation System Administration Conference, LISA 2013
Country/TerritoryUnited States
CityWashington
Period3/11/138/11/13

Fingerprint

Dive into the research topics of 'Supporting Undoability in Systems Operations'. Together they form a unique fingerprint.

Cite this