Tag Archives: reliability

LetGo: A Lightweight Continuous Framework for HPC Applications Under Failures

Bo Fang, Qiang Guan, Nathan Debardeleben, Karthik Pattabiraman, and Matei Ripeanu, ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC), 2017. (Acceptance Rate: 19%) [ PDF | Talk ]

Continue reading

Comments Off on LetGo: A Lightweight Continuous Framework for HPC Applications Under Failures

Filed under papers

One Bit is (Not) Enough: An Empirical Study of the Impact of Single and Multiple Bit-Flip Errors

Behrooz Sangchoolie, Karthik Pattabiraman, and Johan Karlsson, IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), 2017. (Acceptance Rate: 23%). [ PDF | Talk ]

Continue reading

Comments Off on One Bit is (Not) Enough: An Empirical Study of the Impact of Single and Multiple Bit-Flip Errors

Filed under papers

IPA: Error Propagation Analysis of Multi-threaded Programs Using Likely Invariants

Abraham Chan, Stefan Winter, Habib Saissi, Karthik Pattabiraman and Neeraj Suri. Proceedings of the IEEE International Conference on Software Testing, Verification and Validation (ICST), 2017. (Acceptance Rate: 27%) [PDF | Talk]
Continue reading

Comments Off on IPA: Error Propagation Analysis of Multi-threaded Programs Using Likely Invariants

Filed under papers

Configurable Detection of SDC-Causing Errors in Programs

Qining Lu, Guanpeng Li, Karthik Pattabiraman, Meeta Gupta and Jude Rivers, ACM Transactions on Embedded Computing Systems (TECS). [ PDF ]
Continue reading

Comments Off on Configurable Detection of SDC-Causing Errors in Programs

Filed under papers

Understanding Error Propagation in GPGPU Applications

Guanpeng Li, Karthik Pattabiraman, Chen-Yong Cher and Pradip Bose, International Conference for High-Performance Computing, Storage and Networking (SC), 2016. (Acceptance Rate: 18%) [PDF | Talk ] (Link to LLFI-GPU) Continue reading

Comments Off on Understanding Error Propagation in GPGPU Applications

Filed under papers

A Study of Causes and Consequences of JavaScript Bugs

Frolin Ocariza, Kartik Bajaj, Karthik Pattabiraman and Ali Mesbah, IEEE Transactions on Software Engineering (TSE), 2017. [ PDF ] (Bug Database)
Continue reading

Comments Off on A Study of Causes and Consequences of JavaScript Bugs

Filed under papers

Finding Resilience-Friendly Compiler Optimizations using Meta-Heuristic Search Techniques

Nithya Narayanamurthy, Karthik Pattabiraman and Matei Ripeanu. 12th European Dependable Computing Conference (EDCC), 2016. (Acceptance Rate: 41%) [ PDF | Talk ] (Best Paper Award (one of three)) (ece story)
Continue reading

Comments Off on Finding Resilience-Friendly Compiler Optimizations using Meta-Heuristic Search Techniques

Filed under papers

FIDL: A Fault Injection Description Language for Compiler-based SFI Tools

Maryam Raiyat Ailabadi and Karthik Pattabiraman, International Conference on Computer Safety, Reliability and Security (SafeComp), 2016. (Acceptance Rate: 34%) [ PDF | Talk ]
Continue reading

Comments Off on FIDL: A Fault Injection Description Language for Compiler-based SFI Tools

Filed under papers

ePVF: An Enhanced Program Vulnerability Factor Methodology for Cross-Layer Resilience Analysis

Bo Fang, Qining Lu, Karthik Pattabiraman, Matei Ripeanu, and Sudhanva Gurumurthi, IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), 2016 (Acceptance rate: 21%). [ PDF | Talk ]
Continue reading

Comments Off on ePVF: An Enhanced Program Vulnerability Factor Methodology for Cross-Layer Resilience Analysis

Filed under papers

A Systematic Methodology for Evaluating the Error Resilience of GPGPU Applications

Bo Fang, Karthik Pattabiraman, Matei Ripeanu and Sudhanva Gurumurthi, IEEE Transactions on Parallel and Distributed Systems (TPDS), 2016. [ PDF ]
Continue reading

Comments Off on A Systematic Methodology for Evaluating the Error Resilience of GPGPU Applications

Filed under papers