Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
44,398 datasets
Registration data for 16 product categories, including air conditioners, computers, and refrigerators, sold under Minimum Energy Performance Standards (MEPS) in Australia and New Zealand. The data is collected from suppliers and includes fields like brand name, model number, and output range. The dataset does not contain Energy Star ratings, as these products are not required to carry them.
United Kingdom digital vector boundaries for Local Planning Authorities as of January 2026. The data contains full-resolution boundaries clipped to the coastline and is provided by the Office for National Statistics. It was last updated on the platform on March 18, 2026.
Digital vector boundaries for Local Planning Authorities in the United Kingdom, effective January 2026. The boundaries are generalised to 20 meters and clipped to the coastline. The dataset is provided by the Office for National Statistics and was last updated on March 18, 2026.
Local Planning Authorities (January 2026) Boundaries UK BUC contains digital vector boundaries for Local Planning Authorities in the United Kingdom as of January 2026. The boundaries are in an Ultra Generalised (200m) format, clipped to the coastline. The dataset is provided by the Office for National Statistics and was last updated on 2026-03-18.
Approximately 14 orbital files per day provide merged cloud parameters from the OMI instrument on Aura and MODIS on Aqua, both satellites in the NASA A-Train constellation. This Level-2 swath product collocates daytime MODIS cloud statistics onto OMI's 13x24 km visible pixels for near-simultaneous, multi-spectral observation. It is designed for applications exploiting the synergy between the two instruments' complementary measurements.
OMPIXCORZ files contain ground locations of OMI pixel corners for the day lit portion of an orbit, approximately 53 minutes of data per file. This Level-2 product is stored in HDF-EOS5 format with an average file size of about 8 Mbytes, providing the geospatial footprint for each sensor observation. Its primary purpose is to enable visualization, area-based emission calculations, cross-platform pixel mapping, and validation studies for atmospheric data.
Automatically generated data for notices published in the Centralised Automated Information System ‘Electronic Public Procurement’ (CAIS GPP) from February 26 to March 11, 2026. The data is structured according to the Open Contracting Data Standard (OCDS) and is provided by the Data Department of Bulgaria's State e-Government Agency. Information on the publication rules is available via the Public Procurement Portal.
The OMVANC dataset provides selected GEOS-5 Forward Processing parameters co-located with the OMI/Aura VIS 1-Orbit L2 swath. It includes fields such as snow cover, sea ice cover, land cover, terrain height, a row anomaly flag, and pixel area to support atmospheric retrieval algorithms and related research. Data are provided in netCDF4 format, with each file approximately 45 MB in size and spatially aligned to the OMI UV-2 instrument's 13km x 24km nadir resolution.
Thirty-five in situ spectral reflectance measurements were collected from surface water at 24 unique sites in Louisiana's Atchafalaya Basin during October 2016. This dataset, produced by ORNL_CLOUD for NASA's Pre-Delta-X campaign, provides ground-truthing data for the AVIRIS-NG airborne instrument. It directly supports the calibration of algorithms for retrieving total suspended solids (TSS) from remote sensing data.
PREFIRE_SAT2_2B-SFC_COG19um contains Cloud-Optimized GeoTIFF files providing retrieved surface emissivity values for a spectral channel centered at about 18.6 µm. The data are derived from a push broom spectrometer with 63 channels measuring radiation from 5 to 53 µm aboard one of two CubeSats, aiming to fill knowledge gaps in the polar far-infrared emissions for the global energy budget. Retrieved values are computed via an Optimal Estimation method using spectral radiance, cloud mask, and meteorological auxiliary data, and are rendered onto a global grid with raster elements approximately 2.23 km wide.
The Ministry of Economic Affairs and Climate Policy requested an opinion on the 2020 expansion of the SDE+ scheme, the Netherlands' primary instrument for stimulating renewable energy since 2011. The dataset contains calculations from the Unprofitable Top (OT) models used to determine subsidy amounts for renewable energy and other CO2-reducing technologies under the new SDE++ scheme. The data is provided by the Ministry of the Interior and Kingdom Relations under a CC-BY-4.0 license.
The Climate and Energy Outlook (KEV) 2022 dataset from the Netherlands Environmental Assessment Agency (PBL) provides annual monitoring of climate policy progress. It contains projections for Dutch greenhouse gas emissions up to 2030, showing an expected reduction of 39-50% compared to 1990 levels. The data covers developments in energy supply, consumption, agriculture, and land use, connecting autonomous trends with international developments.
Near-real-time data from the Sentinel-5P satellite's TROPOMI instrument, typically available within three hours of measurement and archived for up to ten days. The Level-1B product provides calibrated radiance, irradiance, and engineering data from the shortwave infrared (SWIR, 2305nm-2385nm) band 7 detector at a spatial resolution of 5.5km x 21km. Data is stored in netCDF4 granules, with each file containing one orbit of information and being approximately 0.305 GB in size.
Sub-Saharan Africa burned areas were mapped monthly and annually for the year 2000 using SPOT-VEGETATION satellite imagery at 1 km resolution. The Global Burned Area 2000 (GBA2000) initiative, led by the European Commission's Joint Research Centre, produced this dataset. Burned pixels were identified using a classification tree algorithm applied to the near-infrared channel of the VGT sensor.
Records of individual health service provisions from general medicine first and emergency consultations and general dentistry first consultations. The dataset includes columns for diagnosis, age, sex, external cause, and administrative details like entity name and date. It is hosted by www.datos.gov.co and was last updated on 2026-05-18.
112 high-quality images curated from digital artist Sam Yang (SamDoesArt). This collection focuses on his distinct stylized watercolor, rim-lit, and illustrative aesthetic. The dataset was prepared by Junkyyyy for training custom text-to-image models.
Sentinel-5P's TROPOMI instrument provides near-infrared radiance data within three hours of measurement for rapid atmospheric assessment. Each ~0.650 GB netCDF4 file contains one orbital granule of calibrated radiance, irradiance, and engineering data from the 675nm to 775nm wavelength band. This near-real-time product is a joint initiative of the European Space Agency and the Kingdom of the Netherlands, processed by the Royal Netherlands Meteorological Institute.
Over 3,000 sediment samples from Geoscience Australia's MARS database are synthesized with a geomorphic features dataset to characterize the inter-reefal seabed, which comprises 95% of the Great Barrier Reef Marine Park area. This regional-scale dataset provides a quantitative analysis of surface sediment trends and seabed morphology, updating models since the 1960s-1980s. The synthesis was published by the Australian Ocean Data Network to support Marine Park management.
SWOT Level 2 Lake Single-Pass Vector Product (SWOT_L2_HR_LakeSP_D) provides geolocated surface water measurements for lakes and unclassified water bodies from the Ka-band Radar Interferometer (KaRIn) on the SWOT satellite. Each data granule contains three ESRI shapefiles reporting water surface elevation, surface area, quality indicators, and storage change estimates for features linked to a Prior Lake Database and unassigned water bodies. The product is designed for time series analyses, hydrologic modeling, and monitoring water level and area dynamics across inland water bodies.
Port Curtis Integrated Monitoring Program collected bioaccumulation data from oysters in the Inner Harbour over 11 years. Sensors deployed by the Australian Ocean Data Network gathered this environmental monitoring data from September 2007 to November 2018. The dataset likely contains measurements of contaminants absorbed by oysters.