Talks and Poster Presentations (with Proceedings-Entry):
P Tummeltshammer, A. Steininger:
"Power Supply Induced Common Cause Faults - Experimental Assessment of Potential Countermeasures";
Talk: DSN 2009 (International Conference on Dependable Systems and Networks),
- 2009-07-02; in: "DSN 2009 - Full Program",
Fault-tolerant architectures based on physical replication of components are vulnerable to faults that cause the same effect in all replica. Short outages in a power supply shared by all replica are a prominent example for such common cause faults. For systems in which the provision of a replicated power supply would cause prohibitive efforts the identification of reliable countermeasures against these effects is vital to maintain the required dependability level. In this paper we propose several of such countermeasures, namely parity protection, voltage monitoring and time diversity of the replica. We perform extensive fault injection experiments on three fault-tolerant dual core processor designs, one FPGA based and two commercial ASICs. These experiments provide evidence for the vulnerability of a completely unprotected dual core solution, while time diversity and voltage monitoring in combination with increased timing margins turn out particularly effective for eliminating common cause effects.
Power Supply, Common Cause Faults, Experimental Assessment
Project Head Andreas Steininger:
Created from the Publication Database of the Vienna University of Technology.