Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
44,716 datasets
Colombian air quality data from the La Candelaria station in the Mamonal Industrial sector for January 2022. The dataset includes parameters such as PM10 concentration, wind speed, atmospheric pressure, and solar radiation. It is published by www.datos.gov.co and was last updated on 2026-05-18.
A 2021 survey from October 10 to November 8 acquired bathymetric data for the Australian Hydrographic Office. The dataset provides separate 0.5m and 1m resolution grids for two sites, processed from Kongsberg EM2040D multibeam echosounder data. It was created by Geoscience Australia for calibrating multibeam echosounders under the Hydroscheme Industry Partnership Program.
A collection of 945 uniquely annotated HTTP Archive (HAR) files captured during controlled sessions with three conversational AI platforms. The dataset supports the Human-AI-Platform Attribution Framework and includes artefacts from seven forensic scenarios, such as multi-turn conversations and adversarial prompts. It was authored by Prathmesh Pawar and last updated on June 4, 2026.
A 2023 bathymetry surface created from a national reference survey acquired on 12 Dec 2021 for the Australian Hydrographic Office. The data consists of separate 0.5m resolution grids for two sites near Goods Island and Tuesday Island, provided in MSL, LAT, and Ellipsoid vertical datums. It was processed using QPS Qimera software from data collected via a Kongsberg EM2040D multibeam echosounder.
140 students, lecturers, and researchers from 17 African countries participated in a stratified purposive survey assessing awareness, perceptions, and barriers to synthetic biology. The dataset, created by Erikan Baluku and last updated in April 2026, includes both qualitative and quantitative data from the study. An online training course was piloted, with participants reporting improved understanding of the field.
The dataset contains Level 2 surface reflectance and aerosol parameters over land, and Level 1B top-of-atmosphere reflectance, from the European Space Agency's Sentinel-3B satellite. Data is acquired by the Ocean and Land Colour Instrument (OLCI) and the Sea and Land Surface Temperature Radiometer (SLSTR), with a ground spatial resolution of around 300 meters and a 1270km swath. The data is provided in netCDF 4 format via the aws_open_data platform under a CC-BY-4.0 license.
The NASA Sentinel-3A Project provides Level 2 surface reflectance and aerosol parameters over land, generated by combining data from the OLCI and SLSTR instruments on the Sentinel-3 satellite. The dataset includes 29 measurement data files and 9 annotation data files, with surface reflectances provided on a 300-meter resolution grid. It is hosted on aws_open_data and is licensed under CC-BY-4.0.
EMIT is an imaging spectrometer on the International Space Station measuring surface mineralogy in Earth's arid dust source regions between 52Β° N and 52Β° S latitude. The Level 1B data product provides at-sensor calibrated radiance across 285 spectral bands from 381 to 2493 nanometers at a 60-meter spatial resolution. Each data granule covers an area of approximately 75 by 75 kilometers.
A 0.5m resolution bathymetry survey of three sites in Cairns, Queensland, acquired on 27 May 2020. The surface was created from a contracted national reference survey for the Australian Hydrographic Office to calibrate multibeam echosounders. Data is provided as 32-bit floating point GeoTIFF grids in MSL, LAT, and Ellipsoid vertical datums.
Two high-resolution bathymetric surfaces were created from a 2020 survey in Gulf St Vincent, South Australia, for calibrating multibeam echosounders. The Australian Hydrographic Office contracted the survey, which used Kongsberg EM 2040 and EA440 echosounders and was processed with Caris HIPS & SIPS. Separate 1-meter resolution GeoTIFF grids are provided for two sites in MSL, LAT, and Ellipsoid vertical datums.
A catalog of X-ray luminosities for 401 early-type galaxies, with 136 based on new ROSAT PSPC observations and the rest compiled from literature. The data was used to analyze the L_X/L_B relation and the influence of environment and discrete sources on galaxy X-ray emission. The sample was selected from the LEDA archive with criteria on morphology, recession velocity, and apparent magnitude.
Performance results for three natural language processing models: DistilRoBERTa, BiGRU, and a hybrid DistilRoBERTa-BiGRU. The dataset was authored by Shoukat Ullah and last updated on May 26, 2026. It is stored in an XLS file with a size of 5.5 KB.
13.5 KB Excel file comparing sentiment analysis techniques and outcomes, authored by Shoukat Ullah and last updated on May 26, 2026. The dataset is shared under a CC-BY-4.0 license on the figshare platform.
Hayato Harima published ANOVA and Tukey test results for viral loads in mice infected with recombinant viruses. The 11.6 KB Excel file contains statistical comparisons for viruses with S46N/S49Y or T54A mutations in the ΟA protein. The dataset was last updated on May 26, 2026.
15.2 KB Excel file contains calculated LC50 values, knockdown rates, and mortality for female Aedes aegypti mosquitoes across seven generations (G0βG6). Mosquitoes were treated with three concentrations of cypermethrin in each generation. The dataset was authored by Han-Hsuan Chung and last updated on May 26, 2026.
Tatiane C. M. Sousa published a dataset on figshare in May 2026. The dataset likely contains simple and thematic indicators and indexes used to construct a vulnerability index. The dataset is 5.5 KB in size and is available in XLS format.
Survey data on general opinion and recommendation of vaccines, authored by Noelia RodrΓguez-Blanco. The dataset is stored in an XLS file of 9.5 KB and was last updated on May 26, 2026.
13.5 KB of tabular data comparing general characteristics between reoperated and non-reoperated groups. The dataset is provided by author Yee Ran Lyu and was last updated on May 26, 2026. It is available as an Excel file under a CC-BY-4.0 license.
A pharmacological study from Sungkyunkwan University investigates the antihyperglycemic effect of Prunus amygdalus extract on Streptozotocin-induced diabetic rats. The dataset likely contains experimental results including blood glucose levels, biochemical parameters, and antioxidant activity percentages. It was authored by Richa Sachan and is available via paperswithcode.
Italian and Austrian samples present the distribution of metric variables for sex estimation. The Italian sample was used to develop models based on gender information, while the Austrian sample was used for validation based on morphological sex estimation. The dataset is 17.5 KB in size, authored by Lukas Waltenberger, and was last updated on 2026-05-05.