TY - JOUR
T1 - Resilience Articulation Point (RAP)
T2 - Cross-layer dependability modeling for nanometer system-on-chip resilience
AU - Herkersdorf, Andreas
AU - Aliee, Hananeh
AU - Engel, Michael
AU - Glaß, Michael
AU - Gimmler-Dumont, Christina
AU - Henkel, Jörg
AU - Kleeberger, Veit B.
AU - Kochte, Michael A.
AU - Kühn, Johannes M.
AU - Mueller-Gritschneder, Daniel
AU - Nassif, Sani R.
AU - Rauchfuss, Holm
AU - Rosenstiel, Wolfgang
AU - Schlichtmann, Ulf
AU - Shafique, Muhammad
AU - Tahoori, Mehdi B.
AU - Teich, Jürgen
AU - Wehn, Norbert
AU - Weis, Christian
AU - Wunderlich, Hans Joachim
N1 - Funding Information:
This research is supported by the German Research Foundation (DFG) as part of the priority program “Dependable Embedded Systems” (SPP1500 – spp1500.itec.kit.edu). We would also like to thank all partners within the priority program for their input and feedback.
PY - 2014
Y1 - 2014
N2 - The Resilience Articulation Point (RAP) model aims at provisioning researchers and developers with a probabilistic fault abstraction and error propagation framework covering all hardware/software layers of a System on Chip. RAP assumes that physically induced faults at the technology or CMOS device layer will eventually manifest themselves as a single or multiple bit flip(s). When probabilistic error functions for specific fault origins are known at the bit or signal level, knowledge about the unit of design and its environment allow the transformation of the bit-related error functions into characteristic higher layer representations, such as error functions for data words, Finite State Machine (FSM) state, macro-interfaces or software variables. Thus, design concerns at higher abstraction layers can be investigated without the necessity to further consider the full details of lower levels of design. This paper introduces the ideas of RAP based on examples of radiation induced soft errors in SRAM cells, voltage variations and sequential CMOS logic. It shows by example how probabilistic bit flips are systematically abstracted and propagated towards higher abstraction levels up to the application software layer, and how RAP can be used to parameterize architecture-level resilience methods.
AB - The Resilience Articulation Point (RAP) model aims at provisioning researchers and developers with a probabilistic fault abstraction and error propagation framework covering all hardware/software layers of a System on Chip. RAP assumes that physically induced faults at the technology or CMOS device layer will eventually manifest themselves as a single or multiple bit flip(s). When probabilistic error functions for specific fault origins are known at the bit or signal level, knowledge about the unit of design and its environment allow the transformation of the bit-related error functions into characteristic higher layer representations, such as error functions for data words, Finite State Machine (FSM) state, macro-interfaces or software variables. Thus, design concerns at higher abstraction layers can be investigated without the necessity to further consider the full details of lower levels of design. This paper introduces the ideas of RAP based on examples of radiation induced soft errors in SRAM cells, voltage variations and sequential CMOS logic. It shows by example how probabilistic bit flips are systematically abstracted and propagated towards higher abstraction levels up to the application software layer, and how RAP can be used to parameterize architecture-level resilience methods.
UR - http://www.scopus.com/inward/record.url?scp=84901627755&partnerID=8YFLogxK
U2 - 10.1016/j.microrel.2013.12.012
DO - 10.1016/j.microrel.2013.12.012
M3 - Article
AN - SCOPUS:84901627755
SN - 0026-2714
VL - 54
SP - 1066
EP - 1074
JO - Microelectronics Reliability
JF - Microelectronics Reliability
IS - 6-7
ER -