Matthew Herbert Ning published this dataset on figshare in April 2026. It contains Gini importance scores from a final decision tree model used in a cross-cohort experiment involving LMFP ratios. The dataset is a single 8.4 KB XLSX file.
Use Cases
- Ranking feature contributions based on the reported Gini importance scores.
- Comparing feature relevance for model interpretability in the described experiment.
- Validating or auditing the decision tree model's feature selection process.
Strengths
- Dataset is small (8.4 KB), facilitating quick download and inspection.
- Published under a permissive CC-BY-4.0 license, allowing for reuse and modification.
- Provides a specific model interpretability metric (Gini importance) from a defined experiment.
Limitations
- Row count is unknown, which may limit suitability assessment.
- Column-level documentation is absent; field semantics must be inferred after download.
- Description metadata is limited; actual data quality requires manual inspection after download.
Provenance
- Source
- figshare
- Collection Method
- Likely derived from a machine learning experiment analyzing LMFP ratios across cohorts.
- Time Range
- null
- Freshness
- Last updated 2026-04-30 17:31:42; freshness should be verified.
- Geography
- null