Tag Archives: Bo

TensorFI: A Flexible Fault Injection Framework for TensorFlow Applications

Zitao Chen, Niranjhana Narayanan, Bo Fang, Guanpeng Li, Karthik Pattabiraman, and Nathan DeBardeleben, IEEE International Symposium on Software Reliability Engineering (ISSRE), 2020. (Acceptance Rate: 26%) [ PDF | Talk ] (Code)
Continue reading

Comments Off on TensorFI: A Flexible Fault Injection Framework for TensorFlow Applications

Filed under papers

Improving the Accuracy of IR-Level Fault Injection

Lucas Palazzi, Guanpeng Li, Bo Fang, and Karthik Pattabiraman, IEEE Transactions on Dependable and Secure Computing (TDSC). (Acceptance date: March 2020). [PDF] (Code)
Continue reading

Comments Off on Improving the Accuracy of IR-Level Fault Injection

Filed under papers

A Tale of Two Injectors: End-to-End Comparison of IR-level and Assembly-Level Fault Injection

Lucas Palazzi, Guanpeng Li, Bo Fang, and Karthik Pattabiraman, IEEE International Symposium on Software Reliability Engineering (ISSRE), 2019. (Acceptance Rate: 31.4%) [ PDF | Talk ] (code)
Continue reading

Comments Off on A Tale of Two Injectors: End-to-End Comparison of IR-level and Assembly-Level Fault Injection

Filed under papers

BonVoision: Leveraging Spatial Data Smoothness for Recovery from Memory Soft Errors

Bo Fang, Hassan Halawa, Karthik Pattabiraman, Matei Ripeanu and Sriram Krishnamurthy, , Proceedings of the ACM International Conference on Supercomputing (ICS), 2019. (Acceptance Rate: 23.2 %). [ PDF | Talk ]
Continue reading

Comments Off on BonVoision: Leveraging Spatial Data Smoothness for Recovery from Memory Soft Errors

Filed under papers

LetGo: A Lightweight Continuous Framework for HPC Applications Under Failures

Bo Fang, Qiang Guan, Nathan Debardeleben, Karthik Pattabiraman, and Matei Ripeanu, ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC), 2017. (Acceptance Rate: 19%) [ PDF | Talk ]

Continue reading

Comments Off on LetGo: A Lightweight Continuous Framework for HPC Applications Under Failures

Filed under papers

ePVF: An Enhanced Program Vulnerability Factor Methodology for Cross-Layer Resilience Analysis

Bo Fang, Qining Lu, Karthik Pattabiraman, Matei Ripeanu, and Sudhanva Gurumurthi, IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), 2016 (Acceptance rate: 21%). [ PDF | Talk ]
Continue reading

Comments Off on ePVF: An Enhanced Program Vulnerability Factor Methodology for Cross-Layer Resilience Analysis

Filed under papers

A Systematic Methodology for Evaluating the Error Resilience of GPGPU Applications

Bo Fang, Karthik Pattabiraman, Matei Ripeanu and Sudhanva Gurumurthi, IEEE Transactions on Parallel and Distributed Systems (TPDS), 2016. [ PDF ]
Continue reading

Comments Off on A Systematic Methodology for Evaluating the Error Resilience of GPGPU Applications

Filed under papers

Talk: Tolerating Silent Data Corruption (SDC) causing Hardware Faults Through Software Techniques

Talk given in the Electrical and Computer Engineering Department, Georgia Tech, June 2014. (Also, at IBM T. J. Watson Research, Aug 2014) [ PDF ].
Continue reading

Comments Off on Talk: Tolerating Silent Data Corruption (SDC) causing Hardware Faults Through Software Techniques

Filed under Talks

Evaluating the Error Resilience of Parallel Programs

Bo Fang, Karthik Pattabiraman, Matei Ripeanu and Sudhanva Gurumurthi, To appear in the Workshop on Fault Tolerance for HPC at Extreme Scale (FTXS), 2014. Co-located with the DSN 2014 conference. [ PDF | Talk ]
Continue reading

Comments Off on Evaluating the Error Resilience of Parallel Programs

Filed under papers

GPGPUs: How to Combine High Computational Power with High Reliability

Leonardo Bautista-Gomez, Franck Cappello, Luigi Carro, Nathan DeBardeleben, Bo Fang, Sudhanva Gurumurthi, Karthik Pattabiraman, Paolo Rech, Embedded tutorial, International Symposium on Design, Automation & Test in Europe (DATE’14), Dresden, Germany. [ Paper | Talk ]
Continue reading

Comments Off on GPGPUs: How to Combine High Computational Power with High Reliability

Filed under papers