Habib Saissi, Stefan Winter, Oliver Schwan, Karthik Pattabiraman, and Neeraj Suri, IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), 2020. (Acceptance Rate: 16.5%). [PDF | Talk] (Code)
Abstract: Modern computing systems typically relax execution determinism, for instance by allowing the CPU scheduler to inter-leave the execution of several threads. While beneficial for performance, execution non-determinism affects programs’ execution traces and hampers the comparability of repeated executions. We present TraceSanitizer, a novel approach for execution trace comparison in Error Propagation Analyses (EPA) of multi-threaded programs. TraceSanitizer can identify and compensate for non-determinism caused either by dynamic memory allocation or by non-deterministic scheduling. We formulate a condition under which TraceSanitizer is guaranteed to achieve a 0% false positive rate and automate its verification using Satisfiability Modulo Theory (SMT) solving techniques. TraceSanitizer is comprehensively evaluated using execution traces from the PARSEC and Phoenix benchmarks. In contrast with other approaches, TraceSanitizer eliminates false positives without increasing the false negative rate (for a specific class of programs), with reasonable performance overheads.
-
Recent Papers
- Characterizing and Improving Resilience of Accelerators to Memory Errors in Autonomous Robots
- EdgeEngine: A Thermal-Aware Optimization Framework for Edge Inference
- Evaluating the Effect of Common Annotation Faults on Object Detection Techniques
- Resilience Assessment of Large Language Models under Transient Hardware Faults
- Mixed Precision Support in HPC Applications: What About Reliability?
- Towards Reliability Assessment of Systolic Arrays against Stuck-at Faults
- Overconfidence is a Dangerous Thing: Mitigating Membership Inference Attacks by Enforcing Less Confident Prediction
- Structural Coding: A Low-Cost Scheme to Protect CNNs from Large-Granularity Memory Faults
- A Low-cost Strategic Monitoring Approach for Scalable and Interpretable Error Detection in Deep Neural Networks
- SwarmFuzz: Discovering GPS Spoofing Attacks in Drone Swarms
Pages