The concrete_compressive_strength dataset was curated by the TabArena team for evaluating regression models on independent and identically distributed tabular data. It originates from a 1998 study by Yeh, I-C., published in Cement and Concrete Research. The dataset's intended task is to predict the compressive strength of high-performance concrete.
Use Cases
- Benchmarking regression algorithms based on the described task of predicting concrete compressive strength.
- Evaluating model performance on independent and identically distributed (IID) tabular data as per the dataset's stated focus.
- Studying relationships between concrete mixture components and material strength based on the original research context.
Strengths
- Dataset has a clear, peer-reviewed source from a 1998 Cement and Concrete Research publication.
- Licensed under CC BY 4.0, permitting sharing and adaptation with attribution.
- Curated specifically for a defined machine learning study on IID tabular regression tasks.
Limitations
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count is unknown, which may limit suitability assessment for large-scale training.
- Last update date is unknown; freshness unverified.
Provenance
- Source
- Yeh, I-C. 'Modeling of strength of high-performance concrete using artificial neural networks.' Cement and Concrete research 28.12 (1998): 1797-1808.
- Collection Method
- Curated from an original data source (https://doi.org/10.24432/C5PK67) by the TabArena team.
- Time Range
- 1998
- Freshness
- Dataset Year is 1998; last updated date is unknown.
- Geography
- null