Sign in to view source links and access this dataset
Description
A 23.5 MB dataset related to the SAGE study's data extraction step, authored by Mathias Pietrancosta and last updated on June 3, 2026. The dataset is available under a CC-BY-4.0 license and includes files in PDF, XLSX, DOCX, CSV, and JSON formats.
Use Cases
Benchmark data extraction workflows based on the described SAGE study process.
Analyze data extraction outputs based on the available CSV and JSON file formats.
Study the structure of research data extraction based on the multi-format file collection.
Strengths
Dataset is 23.5 MB in size, indicating a non-trivial volume of content.
Available under a permissive CC-BY-4.0 license for open use.
Includes data in multiple structured formats (CSV, JSON, XLSX).
Limitations
Row count is unknown, which may limit suitability assessment.
Column-level documentation is absent; field semantics must be inferred after download.
Description metadata is limited; actual data quality requires manual inspection after download.
Provenance
Source
figshare
Freshness
Last updated 2026-06-03 19:54:47; freshness should be verified.
The 23.5 MB size suggests a small dataset; scale limitations may apply for large-scale ML tasks.