Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
145,729 datasets
Acceleration data from 19 low-cost, bottom-mounted 'Mini Buoy' floats deployed at three sites in the Mekong Delta, Vietnam. The data were collected between 25 June and 31 August 2022 to identify hydrological tipping points at eroding and expanding mangrove forest edges. The dataset is hosted by the Environmental Information Data Centre and includes gravity-compensated acceleration values in g-force at sub-minute intervals.
Caerlaverock saltmarsh in Scotland provides raw acceleration data and images from 20 monitoring locations spaced at approximately 500-meter intervals. Data was collected by Mini Buoy sensors at 1-second or 10-second intervals between March 2021 and 2022, with coordinates and marsh edge status recorded for each location. The Environmental Information Data Centre produced this dataset to identify hydrological thresholds influencing saltmarsh expansion or erosion.
A dataset detailing the educational programs offered by the certified training schools of the Colombian National Navy. It includes 10 columns such as 'Nombre programa' (program name), 'Modalidad' (modality), and 'Nivel académico' (academic level). The data was published by datos.gov.co and was last updated on 2026-05-18.
Global daily gridded data provides erythemally weighted ultraviolet (UVB) dose and dose rate at local solar noon. Measurements are derived from the Aura-OMI satellite at a 1.0x1.0 degree spatial resolution. Each file contains data from the sunlit portion of the globe and is stored in the HDF-EOS5 format.
NASA's Surface Temperatures from UNL Data Set contains ground-based radiant temperature measurements collected during the FIFE experiment. Data was gathered at three specific sites over a four-week period from July 15 to August 11, 1989, using an Everest multiplexed infrared thermometer. Measurements were coordinated with aircraft and satellite overpasses to study flux variability related to topography, vegetation, and land management.
Sheel Chandra published summary statistics for fitted regression models on four species, including silkworm, on figshare in June 2026. The data includes the proportion of variance explained, calculated from model deviances. The file is a 10.5 KB XLSX spreadsheet.
174 survey responses from nurses on pain management. Descriptive statistics cover knowledge, attitudes, collaboration with physicians, self-efficacy, and practices. The dataset was authored by Phichpraorn Youngcharoen and last updated on June 1, 2026.
A dataset listing the most visited manta ray watching dive and snorkel sites as ranked by tour operators. It includes mean numbers of tourists, tourist vessels, and manta rays observed per trip, categorized by the northeast or southwest monsoon seasons. The dataset was authored by Hannah M. Moloney and last updated on 2026-06-01.
An Origin project file contains the stress and damage distribution for concrete-filled square steel tube members under axial tension. The data illustrates the influence of slenderness ratio on axial tensile performance through load–strain curves. Author Hongtao Zhang published this 105.6 KB file under a CC-BY-4.0 license on figshare.
An Origin project file for the tensile analysis of concrete-filled square steel tube members with different cross-sectional sizes. The file contains the stress distribution of the outer steel tube, damage distribution of the concrete core, and load–strain curves for the corresponding numerical models. It was authored by Hongtao Zhang and last updated on 2026-06-01.
Hongtao Zhang's project file contains data for the tensile analysis of concrete-filled square steel tube members. The 105.6 KB OPJU file includes stress distributions, concrete core damage patterns, and load–strain curves used to evaluate confinement effects. It was last updated on June 1, 2026.
Mid-May through mid-October 1987 data collected from 21 stations across 19 sitegrids in the FIFE study area. This derived dataset compiles original surface flux measurements, including latent and sensible heat flux, with flagged spikes and checks for energy imbalances. It was compiled by NASA and compares observed flux data to model results.
Data from the SWOT mission's Poseidon-3C altimeter, launched December 16, 2022, provides measurements along the satellite's ground track. The dataset contains sea surface height, significant wave height, and wind speed measurements at two sampling resolutions: approximately 6-km at 1Hz and 300-m at 20Hz. It is processed using restituted auxiliary data and Precise Orbit Ephemeris (POE) and is distributed in netCDF-4 format with a latency of less than 90 days.
Data from the SWOT mission's Poseidon-3C altimeter, launched December 16, 2022, provides measurements along the satellite's ground track. It records sea surface height, significant wave height, and wind speed at two sampling resolutions: approximately 6-km at 1Hz and 300-m at 20Hz. The dataset is processed with precise orbit data and distributed in netCDF-4 format with a latency of less than 90 days.
Information assets from the National School of Sport in Colombia, published as part of the E-Government Strategy. The dataset includes columns for language, information categories, descriptions, titles, availability, formats, and storage media. It was last updated on 2026-05-18 18:42:29 and is hosted on the Colombian open data portal www.datos.gov.co.
Frédéric Lesné's work evaluates major types of corruption indicators used for macroeconomic research since the mid-1990s. The study focuses on multi-year indicators with global or regional coverage that provide comparable country scores over time. The dataset is hosted on figshare by Issakha THIAM under a GPL 2.0+ license and was last updated in June 2026.
Multimodal data from 108 patients were collected for a study comparing machine learning and deep learning models for predicting radiation-induced oral mucositis. The dataset includes CT imaging, dose distribution, and clinical features, and was published by Ling Li on figshare in April 2026. It is a small-cohort dataset of 137.2 KB, intended for radiomics and dosimetric analysis.
A dataset of multimodal data from 108 patients, collected to evaluate machine learning models for predicting radiation-induced oral mucositis. The data includes CT imaging, dose distribution, and clinical features. It was authored by Ling Li and last updated on 2026-04 09.
Multimodal data from 108 patients, including CT imaging, dose distribution, and clinical features, used to predict radiation-induced oral mucositis. The dataset was created by Ling Li and published on figshare in April 2026. It contains radiomic features extracted for a comparative study of machine learning and deep learning model performance.
787.1 KB of data from a study investigating deep neural networks as surrogate models for dam-break flows through vegetation. The dataset, authored by Shunsuke Iwasaki and last updated in May 2026, contains results from three-dimensional OpenFOAM simulations validated against laboratory experiments. It was generated to enable rapid assessment of vegetation effects on flooding for nature-based protection strategies.