TY - JOUR
T1 - MORPH
T2 - An Adaptive Framework for Efficient and Byzantine Fault-Tolerant SDN Control Plane
AU - Sakic, Ermin
AU - Deric, Nemanja
AU - Kellerer, Wolfgang
N1 - Publisher Copyright:
© 1983-2012 IEEE.
PY - 2018/10
Y1 - 2018/10
N2 - Current approaches to tackle the single point of failure in SDN entail a distributed operation of SDN controller instances. Their state synchronization process is reliant on the assumption of a correct decision-making in the controllers. Successful introduction of SDN in the critical infrastructure networks also requires catering to the issue of unavailable, unreliable (e.g. buggy), and malicious controller failures. We propose MORPH, a framework tolerant to unavailability and Byzantine failures, which distinguishes and localizes faulty controller instances and appropriately reconfigures the control plane. Our controller-switch connection assignment leverages the awareness of the source of failure to optimize the number of active controllers and minimize the controller and switch reconfiguration delays. The proposed re-Assignment executes dynamically after each successful failure identification. We require 2F{M}+F{A}+1 controllers to tolerate {M} malicious and F{A} availability-induced failures. After a successful detection of F{M} malicious controllers, MORPH reconfigures the control plane to require a single controller message to forward the system state. Moreover, we outline and present a solution to the practical correctness issues related to the statefulness of the distributed SDN controller applications, previously ignored in the literature. We base our performance analysis on a resource-Aware routing application, deployed in an emulated testbed comprising up to 16 controllers and up to 34 switches, so to tolerate up to 5 unique Byzantine and additional 5 availability-induced controller failures (a total of 10 unique controller failures). We quantify and highlight the dynamic decrease in the packet and CPU load and the response time after each successful failure detection.
AB - Current approaches to tackle the single point of failure in SDN entail a distributed operation of SDN controller instances. Their state synchronization process is reliant on the assumption of a correct decision-making in the controllers. Successful introduction of SDN in the critical infrastructure networks also requires catering to the issue of unavailable, unreliable (e.g. buggy), and malicious controller failures. We propose MORPH, a framework tolerant to unavailability and Byzantine failures, which distinguishes and localizes faulty controller instances and appropriately reconfigures the control plane. Our controller-switch connection assignment leverages the awareness of the source of failure to optimize the number of active controllers and minimize the controller and switch reconfiguration delays. The proposed re-Assignment executes dynamically after each successful failure identification. We require 2F{M}+F{A}+1 controllers to tolerate {M} malicious and F{A} availability-induced failures. After a successful detection of F{M} malicious controllers, MORPH reconfigures the control plane to require a single controller message to forward the system state. Moreover, we outline and present a solution to the practical correctness issues related to the statefulness of the distributed SDN controller applications, previously ignored in the literature. We base our performance analysis on a resource-Aware routing application, deployed in an emulated testbed comprising up to 16 controllers and up to 34 switches, so to tolerate up to 5 unique Byzantine and additional 5 availability-induced controller failures (a total of 10 unique controller failures). We quantify and highlight the dynamic decrease in the packet and CPU load and the response time after each successful failure detection.
KW - Byzantine fault tolerance
KW - SDN
KW - availability
KW - distributed control plane
KW - empirical study
KW - reliability
UR - http://www.scopus.com/inward/record.url?scp=85055034966&partnerID=8YFLogxK
U2 - 10.1109/JSAC.2018.2869938
DO - 10.1109/JSAC.2018.2869938
M3 - Article
AN - SCOPUS:85055034966
SN - 0733-8716
VL - 36
SP - 2158
EP - 2174
JO - IEEE Journal on Selected Areas in Communications
JF - IEEE Journal on Selected Areas in Communications
IS - 10
M1 - 8490892
ER -