Zitao Chen, Niranjhana Narayanan, Bo Fang, Guanpeng Li, Karthik Pattabiraman, and Nathan DeBardeleben, IEEE International Symposium on Software Reliability Engineering (ISSRE), 2020. (Acceptance Rate: 26%) [ PDF | Talk ] (Code)
Continue reading
Tag Archives: Bo
TensorFI: A Flexible Fault Injection Framework for TensorFlow Applications
Comments Off on TensorFI: A Flexible Fault Injection Framework for TensorFlow Applications
Filed under papers
Improving the Accuracy of IR-Level Fault Injection
Lucas Palazzi, Guanpeng Li, Bo Fang, and Karthik Pattabiraman, IEEE Transactions on Dependable and Secure Computing (TDSC). (Acceptance date: March 2020). [PDF] (Code)
Continue reading
Comments Off on Improving the Accuracy of IR-Level Fault Injection
Filed under papers
A Tale of Two Injectors: End-to-End Comparison of IR-level and Assembly-Level Fault Injection
Lucas Palazzi, Guanpeng Li, Bo Fang, and Karthik Pattabiraman, IEEE International Symposium on Software Reliability Engineering (ISSRE), 2019. (Acceptance Rate: 31.4%) [ PDF | Talk ] (code)
Continue reading
Comments Off on A Tale of Two Injectors: End-to-End Comparison of IR-level and Assembly-Level Fault Injection
Filed under papers
BonVoision: Leveraging Spatial Data Smoothness for Recovery from Memory Soft Errors
Bo Fang, Hassan Halawa, Karthik Pattabiraman, Matei Ripeanu and Sriram Krishnamurthy, , Proceedings of the ACM International Conference on Supercomputing (ICS), 2019. (Acceptance Rate: 23.2 %). [ PDF | Talk ]
Continue reading
Comments Off on BonVoision: Leveraging Spatial Data Smoothness for Recovery from Memory Soft Errors
Filed under papers
LetGo: A Lightweight Continuous Framework for HPC Applications Under Failures
Bo Fang, Qiang Guan, Nathan Debardeleben, Karthik Pattabiraman, and Matei Ripeanu, ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC), 2017. (Acceptance Rate: 19%) [ PDF | Talk ]
Comments Off on LetGo: A Lightweight Continuous Framework for HPC Applications Under Failures
Filed under papers
ePVF: An Enhanced Program Vulnerability Factor Methodology for Cross-Layer Resilience Analysis
Bo Fang, Qining Lu, Karthik Pattabiraman, Matei Ripeanu, and Sudhanva Gurumurthi, IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), 2016 (Acceptance rate: 21%). [ PDF | Talk ]
Continue reading
Comments Off on ePVF: An Enhanced Program Vulnerability Factor Methodology for Cross-Layer Resilience Analysis
Filed under papers
A Systematic Methodology for Evaluating the Error Resilience of GPGPU Applications
Bo Fang, Karthik Pattabiraman, Matei Ripeanu and Sudhanva Gurumurthi, IEEE Transactions on Parallel and Distributed Systems (TPDS), 2016. [ PDF ]
Continue reading
Comments Off on A Systematic Methodology for Evaluating the Error Resilience of GPGPU Applications
Filed under papers
Talk: Tolerating Silent Data Corruption (SDC) causing Hardware Faults Through Software Techniques
Talk given in the Electrical and Computer Engineering Department, Georgia Tech, June 2014. (Also, at IBM T. J. Watson Research, Aug 2014) [ PDF ].
Continue reading
Comments Off on Talk: Tolerating Silent Data Corruption (SDC) causing Hardware Faults Through Software Techniques
Filed under Talks
Evaluating the Error Resilience of Parallel Programs
Bo Fang, Karthik Pattabiraman, Matei Ripeanu and Sudhanva Gurumurthi, To appear in the Workshop on Fault Tolerance for HPC at Extreme Scale (FTXS), 2014. Co-located with the DSN 2014 conference. [ PDF | Talk ]
Continue reading
Comments Off on Evaluating the Error Resilience of Parallel Programs
Filed under papers
GPGPUs: How to Combine High Computational Power with High Reliability
Leonardo Bautista-Gomez, Franck Cappello, Luigi Carro, Nathan DeBardeleben, Bo Fang, Sudhanva Gurumurthi, Karthik Pattabiraman, Paolo Rech, Embedded tutorial, International Symposium on Design, Automation & Test in Europe (DATE’14), Dresden, Germany. [ Paper | Talk ]
Continue reading
Comments Off on GPGPUs: How to Combine High Computational Power with High Reliability
Filed under papers