Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
41,488 datasets
Weighted clearance rates for police-reported crimes in Ontario, based on the principles of the Police Reported Crime Severity Index. The data is compiled by the Ministry of the Solicitor General and can be accessed from Statistics Canada. The dataset was last updated on April 17, 2026.
Geoscience Australia's analysis of sulfide and alteration mineral chemistry across Australia's largest under-explored province reveals distinct differences between mineralization styles. This pre-competitive geoscience data, presented at the 2025 SEG Conference, was produced by a collaboration including the Exploring for the Future program and the Resourcing Australia's Prosperity initiative. The study spans five Australian states and aims to support sustainable resource management and mineral exploration.
Geoscience Australia, AIMS, and the Northern Territory Government collected benthic sediment oxygen demand measurements in inner Darwin Harbour and shallow waters around Bynoe Harbour between May 29 and June 19, 2017. This dataset is part of a four-year (2014-2018) science program to establish baseline data for habitat mapping and marine resource management. The project was led by the Northern Territory Government and supported by the INPEX-led Ichthys LNG Project.
BC Municipal Solid Waste Disposal Rates contains estimates of the per-capita amount, in kilograms, of municipal solid waste disposed of in British Columbia, by regional district, from 1990 to 2023. The data were collected by the Ministry of Environment and the BC Climate Action Secretariat from regional districts, with some estimates provided by ministry staff. The dataset is published by the Government of British Columbia.
VIIRS/NPP VNP17A2GF Version 2 provides gap-filled 8-day composites of Gross Primary Productivity and Net Photosynthesis at a 500-meter spatial resolution. The dataset is a Level 4 product generated annually using a radiation use efficiency model, with poor-quality inputs from Leaf Area Index and Fraction of Photosynthetically Active Radiation data cleaned via linear interpolation. It serves as an input for models calculating terrestrial energy, carbon, water cycle processes, and vegetation biogeochemistry.
Yearly composites from 2012 onward provide global land surface data on actual and potential evapotranspiration (ET/PET) and latent heat flux (LE/PLE) at a 500-meter resolution. The dataset is derived from NOAA-20 VIIRS satellite data and meteorological reanalysis using a Penman-Monteith algorithm, with gap-filling applied to vegetation inputs for quality. It is produced annually, not in near-real-time, and includes quality control layers.
VIIRS/JPSS1 VJ117A3GF Version 2 provides yearly, gap-filled composites of Gross and Net Primary Production (GPP/NPP) for global vegetation monitoring. Its 500-meter spatial resolution and Level 4 processing, which includes linear interpolation for poor-quality LAI/FPAR inputs, offer a cleaned data product for modeling terrestrial energy and biogeochemical cycles. However, users cannot access this dataset in near-real-time, as it is generated only at the end of each calendar year.
An 8-day composite dataset from 2012 onward provides gap-filled estimates of actual and potential evapotranspiration (ET) and latent heat flux (LE) at a 500-meter pixel resolution. The product, VNP16A2GF, is generated annually using the Penman-Monteith equation, which integrates VIIRS vegetation data and meteorological reanalysis. It includes five core variables: ET, PET, LE, PLE, and a quality control layer, with pixel values representing 8-day water loss summation or daily average energy.
NASA/NOAA Suomi NPP VIIRS Actual and Potential Evapotranspiration (VNP16A2) is an 8-day composite dataset at 500-meter resolution. The algorithm uses the Penman-Monteith equation, integrating daily meteorological reanalysis with VIIRS-derived vegetation and albedo data. It provides layers for actual and potential evapotranspiration (ET, PET) and latent heat flux (LE, PLE), along with a quality control variable.
Daily global albedo data at 1-kilometer resolution is produced using 16 days of VIIRS observations from the Suomi NPP satellite, weighted to the ninth day. The dataset provides 36 science variables, including black-sky and white-sky albedo for nine VIIRS moderate bands and three broadbands, using the RTLSR kernel-driven BRDF model. It is designed to continue the legacy of NASA's MODIS BRDF/Albedo product suite for monitoring land surface radiative properties.
Global daily land surface reflectance data at 1-kilometer resolution is provided by the Suomi NPP satellite's VIIRS instrument. The VNP43IA4 product corrects for view-angle effects using a 16-day rolling window and the RossThick/Li-Sparse-Reciprocal BRDF model to produce stable, nadir-adjusted reflectance values. It includes 18 science data layers for nine spectral bands and is designed for continuity with NASA's MODIS BRDF/Albedo product suite.
NASA/NOAA's Suomi NPP VIIRS VNP43IA4 product provides daily, 500-meter resolution Nadir Bidirectional Reflectance Distribution Function (BRDF) Adjusted Reflectance (NBAR) estimates. It is generated daily using a 16-day data window, applying the RossThick/Li-Sparse-Reciprocal BRDF model to correct for view-angle effects and produce stable surface reflectance for VIIRS imagery bands I1, I2, and I3. The dataset supports continuity with MODIS products and enables the calculation of black-sky and white-sky albedo.
2.5 MB of user-generated text from Reddit communities related to sleep and mental health, supporting the SleepDepNet multi-task learning framework. The dataset, created by Akshi Kumar and last updated on 2026-05-07, is licensed under CC-BY-4.0 and available in CSV format. It was used to train models achieving F1-scores of 0.89 for sleep quality classification and 0.86 for depressive sentiment analysis.
Takashi Ida's dataset supports a study on heteranthery's ecological function. It contains measurements of bee pollinator traits, anther and pollen characteristics, and plant traits for Lagerstroemia indica, all collected in 2023. The data is stored in three Excel workbooks totaling 173.3 KB.
A 2019-2022 survey acquired by the NSW government's Department of Planning and Environment onboard the Research Vessel Bombora. It provides 5-meter resolution 32-bit floating point geotiff files of bathymetry and backscatter for the Forster Pacific Palms Cape Hawke, NSW area, processed using Hypack, R2Sonic GUI, and Qimera software. The dataset was created to establish a baseline and map the spatial distribution of seabed types.
Reduced radiometric point-located data measures gamma-ray emissions from potassium, uranium, and thorium in the Earth's surface. The dataset was acquired in 2024 by the WA Government and consists of 415,090 line-kilometres of data. It has been processed with noise filtering, background corrections, and levelling techniques.
Geoscience Australia Data provides a series of 1:1,000,000 lithofacies maps of continental shelf sediments. The maps result from systematic reconnaissance geological surveys initiated by the Bureau of Mineral Resources following a 1967 monograph. Three map sheets covering Rowley Shoals, Scott Reef, and the Arafura Sea were printed by early 1974.
Six sedimentary cycles, each hundreds of metres thick, are documented for the Surat Basin in Australia. The data, provided by Geoscience Australia, describes cycles from the Jurassic and Cretaceous periods, correlating them with global sea-level oscillations. The record was last updated on 2026-05-14.
5.5 KB of data supporting a study on optimal seamline detection for Synthetic Aperture Radar (SAR) image mosaicking. The dataset, authored by Dong Yan and last updated in May 2026, is associated with a method using superpixel segmentation and region merging to improve geometric and radiometric consistency in stitched images. It likely contains results from comparative experiments evaluating obstacle-avoidance capability and computational efficiency against classical methods.
A small, 5.5 KB dataset containing quantitative metric results from a study proposing an optimal seamline detection method for Synthetic Aperture Radar (SAR) images. The dataset, authored by Dong Yan and last updated in May 2026, compares the proposed superpixel segmentation and region merging approach against two classical methods. The results likely include metrics related to computational efficiency and mosaicking quality.