Sign in to view source links and access this dataset
Description
A challenge dataset for multi-cloud Site Reliability Engineering (SRE) fault troubleshooting based on a real e-commerce microservices system spanning three clouds (Alibaba Cloud, Tencent Cloud, AWS). The dataset contains structured case data with fault phenomena, injection scripts, recovery scripts, ideal answers, and scoring rubrics. It was created by author 'kluoms' and last updated on Hugging Face in May 2026.
Use Cases
Training SREs on multi-cloud fault diagnosis based on structured case scenarios.
Benchmarking automated incident response systems using the provided injection and recovery scripts.
Developing educational tools for cloud operations based on the detailed ideal answers and scoring rubrics.
Strengths
Based on a real e-commerce microservices system spanning three major cloud providers.
Includes structured components like fault phenomena, scripts, ideal answers, and scoring rubrics.
Limitations
Description metadata is limited; actual data quality requires manual inspection after download.
Column-level documentation is absent; field semantics must be inferred after download.
Row count is unknown, which may limit suitability assessment.
Provenance
Source
huggingface
Collection Method
Likely derived from real or simulated multi-cloud SRE operations.
Freshness
Last updated 2026-05-27 14:02:23; freshness should be verified.
License restrictions are unknown and should be verified before use.