A derived version of the Sentinel-2 Land Cover Dataset, precomputed and reformatted for direct use with the Clay foundation model for Earth observation. The dataset was prepared by author wtr001 and last updated on March 17, 2026. It is designed to bypass typical preprocessing steps like tiling during data loading for training or inference pipelines.
Use Cases
- Training the Clay foundation model for Earth observation based on precomputed Sentinel-2 imagery and labels.
- Conducting land cover segmentation inference using a preprocessed dataset format.
- Benchmarking geospatial AI models on a standardized, pre-tiled satellite imagery dataset.
Strengths
- Precomputed for the Clay model, eliminating standard preprocessing steps like tiling during data loading.
- Derived from the established Sentinel-2 Land Cover Dataset, suggesting a foundation in recognized satellite data.
Limitations
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count and file size are unknown, which may limit suitability assessment.
- Description metadata is limited; actual data quality requires manual inspection after download.
Provenance
- Source
- Derived from the Sentinel-2 Land Cover Dataset (Zarr Format).
- Collection Method
- Precomputed and reformatted for compatibility with the Clay foundation model.
- Time Range
- null
- Freshness
- Last updated 2026-03-17 20:22:41.
- Geography
- Platform tags suggest a US region focus, but the description does not specify spatial coverage.