Tag Archives: Cloud

Failure Prediction of Jobs in Compute Clouds: A Google Cluster Case Study

Xin Chen, Charng-da Lu and Karthik Pattabiraman, To appear in the Workshop on Reliability and Security Data Analysis (RSDA), Held in conjunction with the IEEE International Symposium on Software Reliability Engineering (ISSRE),2014. [ PDF | Talk ]
Continue reading

Comments Off on Failure Prediction of Jobs in Compute Clouds: A Google Cluster Case Study

Filed under papers

Failure Analysis of Jobs in Compute Clouds: A Google Cluster Case Study

Xin Chen, Charng-da Lu and Karthik Pattabiraman, 25th IEEE International Symposium on Software Reliability Engineering (ISSRE), 2014. (Accept Rate: 25%). [ PDF | Talk ] Download DataSet (This paper was chosen as “highlights of ISSRE” in 2019 – one of 26 papers chosen among over 1000 papers in the 30 year history of the conference) (ECE story).
Continue reading

Comments Off on Failure Analysis of Jobs in Compute Clouds: A Google Cluster Case Study

Filed under papers

Predicting Job Completion Times Using System Logs in Supercomputing Clusters

Xin Chen, Charng-da Lu and Karthik Pattabiraman, Proceedings of the IEEE Workshop on Reliability and Security Data Analysis, 2013. [ PDF File | Talk ]
Continue reading

Comments Off on Predicting Job Completion Times Using System Logs in Supercomputing Clusters

Filed under papers