Terraform infrastructure-as-code files are the subject of this dataset for predicting software defects. It likely contains metrics and labels for files to support machine learning models in software quality analysis. The dataset is published on Kaggle, but its specific size, origin, and creation date are not provided.
Use Cases
- Training a classifier to predict defective Terraform configuration files (inferred from domain, verify after download)
- Analyzing code metrics correlated with defects in infrastructure-as-code (inferred from domain, verify after download)
- Benchmarking defect prediction models for declarative programming languages (inferred from domain, verify after download)
Strengths
- Published on Kaggle, a major platform for sharing datasets.
Limitations
- Metadata is minimal; actual content requires verification after download.
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count is unknown, which may limit suitability assessment.