Tag Archives: reliability

ImmunoPlane: Middleware for Providing Adaptivity to Distributed Internet-of-Things Applications

Kumseok Jung, Gargi Mitra, Sathish Gopalakrishnan and Karthik Pattabiraman, To appear in the Proceedings of the ACM/IEEE Conference on Internet of Things Design and Implementation (IoTDI), 2024. (Acceptance Rate: TBD) [ PDF (coming soon) | Talk] (code)
Continue reading

Characterizing and Improving Resilience of Accelerators to Memory Errors in Autonomous Robots

Deval Shah, Zi Yu Xue, Karthik Pattabiraman and Tor Aamodt, To appear in the ACM Transactions on Cyber-Physical Systems (TCPS). (Acceptance date: Sep 2023). [PDF] (arXIV)
Continue reading

Evaluating the Effect of Common Annotation Faults on Object Detection Techniques

Abraham Chan, Arpan Gujarati, Karthik Pattabiraman and Sathish Gopalakrishnan, Proceedings of the IEEE International Symposium on Software Reliability Engineering (ISSRE), 2023. (Acceptance Rate: 28.5%) [ PDF | Talk ] (Code). Artifacts Available and Reviewed.

Continue reading

Resilience Assessment of Large Language Models under Transient Hardware Faults

Udit Agarwal, Abraham Chan, and Karthik Pattabiraman, Proceedings of the IEEE International Symposium on Software Reliability Engineering (ISSRE), 2023. (Acceptance Rate: 28.5%) [ PDF | Talk ] (Code). Artifacts Available and Reviewed.
Continue reading

Mixed Precision Support in HPC Applications: What About Reliability?

Alessio Netti, Yang Peng, Patrik Omland, Michael Paulitsch, Jorge Parra, Gustavo Espinosa, Udit Agarwal, Abraham Chan, and Karthik Pattabiraman, Journal of Parallel and Distributed Computing (JPDC). [ PDF ] (code)
Continue reading

Structural Coding: A Low-Cost Scheme to Protect CNNs from Large-Granularity Memory Faults

Ali Asgari, Florian Geissler, Syed Qutub, Michael Paulitsch, Prashant Nair, and Karthik Pattabiraman, Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis (SC), 2023. (Acceptance Rate: 23.9%) [ PDF | Talk ] (code). Artifacts Available and Functional
Continue reading

A Low-cost Strategic Monitoring Approach for Scalable and Interpretable Error Detection in Deep Neural Networks

Florian Geissler, Syed Qutub, Michael Paulitsch and Karthik Pattabiraman, Proceedings of the International Conference on Computer Safety, Reliability and Security (SafeComp), 2023. (Acceptance Rate: 20%) [PDF | Talk]
Continue reading

LLTFI: Framework Agnostic Fault Injection for Machine Learning Applications (Tools and Artifact Track)

Udit Agarwal, Abraham Chan, and Karthik Pattabiraman, IEEE International Symposium on Software Reliability Engineering (ISSRE), 2022. (Acceptance Rate: 29%) [ PDF | Talk (video) ] (Code)
Continue reading

Fault Injection for TensorFlow Applications

Niranjhana Narayanan, Zitao Chen, Bo Fang, Guanpeng Li, Karthik Pattabiraman, and Nathan DeBardeleben, IEEE Transactions on Dependable and Secure Computing (TDSC). Acceptance Date: May 2022. [ PDF ] (code1, code2)
Continue reading

The Fault in Our Data Stars: Studying Mitigation Techniques against Faulty Training Data in ML Applications

Abraham Chan, Arpan Gujarati, Karthik Pattabiraman, and Sathish Gopalakrishnan. IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), 2022. (Acceptance rate: 18.7%) [ PDF | Talk ] (Code)
Continue reading