Peng Zhou published a dataset on figshare in April 2026 comparing the predictive performance of different machine learning models on training and test sets. The dataset is 5.5 KB in size and is available as an XLS file under a CC-BY-4.0 license.
Use Cases
- Compare model performance metrics between training and test sets based on the dataset's title
- Demonstrate overfitting or generalization patterns in educational settings based on the dataset's title
- Teach principles of model evaluation and validation using a concrete example
Strengths
- Dataset is openly licensed under CC-BY-4.0, allowing for reuse and redistribution
- File size is 5.5 KB, indicating a small, easily downloadable resource
Limitations
- Row count is unknown, which may limit suitability assessment
- Column-level documentation is absent; field semantics must be inferred after download
Provenance
- Source
- figshare
- Freshness
- Last updated 2026-04-24 17:43:47; freshness should be verified