Tag Archives: Bo

A Tale of Two Injectors: End-to-End Comparison of IR-level and Assembly-Level Fault Injection

Lucas Palazzi, Guanpeng Li, Bo Fang, and Karthik Pattabiraman, IEEE International Symposium on Software Reliability Engineering (ISSRE), 2019. (Acceptance Rate: 31.4%) [ PDF | Talk ] (code)
Continue reading

BonVoision: Leveraging Spatial Data Smoothness for Recovery from Memory Soft Errors

Bo Fang, Hassan Halawa, Karthik Pattabiraman, Matei Ripeanu and Sriram Krishnamurthy, , Proceedings of the ACM International Conference on Supercomputing (ICS), 2019. (Acceptance Rate: 23.2 %). [ PDF | Talk ]
Continue reading

LetGo: A Lightweight Continuous Framework for HPC Applications Under Failures

Bo Fang, Qiang Guan, Nathan Debardeleben, Karthik Pattabiraman, and Matei Ripeanu, ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC), 2017. (Acceptance Rate: 19%) [ PDF | Talk ]

Continue reading

ePVF: An Enhanced Program Vulnerability Factor Methodology for Cross-Layer Resilience Analysis

Bo Fang, Qining Lu, Karthik Pattabiraman, Matei Ripeanu, and Sudhanva Gurumurthi, IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), 2016 (Acceptance rate: 21%). [ PDF | Talk ]
Continue reading

A Systematic Methodology for Evaluating the Error Resilience of GPGPU Applications

Bo Fang, Karthik Pattabiraman, Matei Ripeanu and Sudhanva Gurumurthi, IEEE Transactions on Parallel and Distributed Systems (TPDS), 2016. [ PDF ]
Continue reading

Talk: Tolerating Silent Data Corruption (SDC) causing Hardware Faults Through Software Techniques

Talk given in the Electrical and Computer Engineering Department, Georgia Tech, June 2014. (Also, at IBM T. J. Watson Research, Aug 2014) [ PDF ].
Continue reading

Evaluating the Error Resilience of Parallel Programs

Bo Fang, Karthik Pattabiraman, Matei Ripeanu and Sudhanva Gurumurthi, To appear in the Workshop on Fault Tolerance for HPC at Extreme Scale (FTXS), 2014. Co-located with the DSN 2014 conference. [ PDF | Talk ]
Continue reading

GPGPUs: How to Combine High Computational Power with High Reliability

Leonardo Bautista-Gomez, Franck Cappello, Luigi Carro, Nathan DeBardeleben, Bo Fang, Sudhanva Gurumurthi, Karthik Pattabiraman, Paolo Rech, Embedded tutorial, International Symposium on Design, Automation & Test in Europe (DATE’14), Dresden, Germany. [ Paper | Talk ]
Continue reading

GPU-Qin: A Methodology for Evaluating the Error Resilience of GPGPU Applications

Bo Fang, Karthik Pattabiraman, Matei Ripeanu and Sudhanva Gurumurthi, To appear in the proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2014, Monterrey, CA. (Acceptance rate: 30 %) [PDF | Talk ]
Continue reading

Towards Building Error Resilient GPGPU Applications

Bo Fang, Jiesheng Wei, Karthik Pattabiraman and Matei Ripeanu,
Workshop on Error Resilient Architectures (WRA), held in conjunction with Micro 2012. [ PDF File | Talk Slides ]
Continue reading