• TensorFI: A Flexible Fault Injection Framework for TensorFlow Applications

    standard
  • Improving the Accuracy of IR-Level Fault Injection

    standard
  • A Tale of Two Injectors: End-to-End Comparison of IR-level and Assembly-Level Fault Injection

    standard
  • BonVoision: Leveraging Spatial Data Smoothness for Recovery from Memory Soft Errors

    standard
  • LetGo: A Lightweight Continuous Framework for HPC Applications Under Failures

    standard
  • ePVF: An Enhanced Program Vulnerability Factor Methodology for Cross-Layer Resilience Analysis

    standard
  • A Systematic Methodology for Evaluating the Error Resilience of GPGPU Applications

    standard
  • Talk: Tolerating Silent Data Corruption (SDC) causing Hardware Faults Through Software Techniques

    standard
  • Evaluating the Error Resilience of Parallel Programs

    standard
  • GPGPUs: How to Combine High Computational Power with High Reliability

    standard