Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
167,348 datasets
Fourteen male elite speed climbing athletes performed single-leg squat jumps and a 15-meter speed climb. Surface electromyography and vertical ground reaction force data were collected synchronously to compute co-activation indices and asymmetry. The dataset was authored by Sukwon Kim and last updated on June 30, 2026.
The PRISM benchmark dataset supports research on detecting generated images. It contains photorealistic synthesized and manipulated images, with corresponding real images sourced from the COCO 2017 Train and Validation sets. The dataset was created by oppiliF and is associated with a 2026 research paper.
Fall 2007 and spring 2009 Antarctic sea ice data from the SIMBA program, collected during the Nathaniel B. Palmer icebreaker drift with a buoy network. Measurements include ice thickness, temperature profiles, large-scale deformation, and other sea ice characteristics. The dataset is provided by NASA.
A 2026 investigation by the Government of Yukon assessed the quality and quantity of aggregate resources at four specific sites in the Whitehorse area. The project aimed to support local development by maximizing resource extraction near existing infrastructure. The data likely contains geophysical and borehole information for sites including McLean Lake, Takhini Bridge, Haekle Hill, and Long Lake.
Graduation counts from the University of Cauca, disaggregated by academic period, faculty, and program level. The dataset includes columns for specific semesters from 2018-1 to 2025-2, suggesting a time-series structure. It is hosted by the Colombian open data portal www.datos.gov.co and was last updated on 2026-05-18.
A 150-km-long belt of mica-quartz schist and olivine serpentinites wedged between the Yukon-Tanana Terrane and the Insular Superterrane in southwest Yukon. The dataset likely contains geological and geospatial data on these 'Alpine-type' ultramafic rocks, interpreted as fragments of oceanic crust from a Late Cretaceous back-arc basin. It is provided by the Government of Yukon on the open_canada platform.
Southern Tay River map area (NTS 105K) stratigraphy data describes the Upper Triassic to Lower Jurassic Faro Peak formation. The Government of Yukon published this geological description, which details two lithologically distinct members with thicknesses of ~650 meters and >800 meters. The dataset was last updated on 2026-04-17.
Three suites of intrusions, including 10–200 m thick mafic sills and 2–3 m wide alkaline dikes, are documented within Paleoproterozoic to Paleozoic sedimentary strata. The dataset describes northwest-verging folding and thrusting in the Wernecke Supergroup and gentle folds in younger groups. This geological update was published by the Government of Yukon on April 17, 2026.
Government and Municipalities of Québec provide a dataset of electric vehicle charging stations. The data is available in CSV format and is licensed under CC-BY-4.0. The dataset was last updated on 2026-04-22.
Land use data details the distribution of areas for natural, improved, and cut pastures, as well as forage crops and silvopastoral systems across municipalities in the Sucre department of Colombia. The dataset includes surface area in hectares for each pasture type and municipality, along with subregion classification. It was published on the Colombian open data portal, datos.gov.co, and was last updated on 2026-05-18.
0.6 x 0.6 meter resolution canopy height estimates for mangrove forests across three sites in southeastern Mozambique. The National Aeronautics and Space Administration produced this dataset from WorldView-1 stereo images processed with the Ames Stereo Pipeline in September 2012. It provides a very high-resolution digital surface model for a specific region and time period.
An ensemble of three machine learning models and a generalized additive model predicts daily Nitrogen Dioxide levels at a 1-km resolution from 2000 to 2016. The modeling framework incorporates satellite column concentrations, land-use data, meteorological variables, and outputs from chemical transport models GEOS-Chem and CMAQ. This high-resolution dataset supports research into the short- and long-term health effects of air pollution across the contiguous United States.
Quebec municipal day camp statistics from 2013, archived on open_canada. The dataset contains operational, attendance, and staffing information aggregated by district. It was last updated on the platform in May 2026 and is provided by the Government and Municipalities of Québec under a CC-BY-4.0 license.
Four datasets from different geographical locations used to analyze machine learning algorithms for wind power prediction. The data was processed by author Usman Ali, with outliers removed using Z-score and IQR methods, and last updated on April 30, 2026. It is a small dataset (5.5 KB) stored in an XLS file format.
Usman Ali published a dataset on 2026-04-30 for analyzing machine learning algorithms in wind power forecasting. The data, stored in an XLS file of 5.5 KB, originates from four different geographical locations and has been processed to remove outliers. It was used to evaluate XGBoost, Random Forest Regression, and Support Vector Regression models for predicting wind power output.
Four datasets from different geographical locations contain meteorological and power data for wind energy forecasting. The data was used in a study by Usman Ali, last updated in April 2026, to evaluate machine learning algorithms like XGBoost and Support Vector Regression. The file is 5.5 KB in size and is available in XLS format.
5.5 KB of data from four wind farm locations used to benchmark machine learning models for power forecasting. Author Usman Ali published the dataset on figshare in 2026, comparing XGBoost, Random Forest, and Support Vector Regression algorithms. The study reports high accuracy, with R² values of 0.99 for top-performing models across most sites.
31 in-depth interviews with primary health care providers and District Health Team members from two rural districts in Uganda. The data was thematically analyzed using the Consolidated Framework for Implementation Research (CFIR) to inform the design of a mobile application for dementia care. The dataset was authored by Edith K. Wakida and last updated on 2026-04-30.
Hamed Tabesh published a dataset on 2026-04-30 containing results from a study forecasting daily Emergency Department patient arrivals. The data likely includes performance metrics for ARIMA, ANN, LSTM, GLM, and two hybrid forecasting algorithms, incorporating meteorological and calendar influences. The dataset is 5.5 KB in size and is available in XLS format under a CC-BY-4.0 license.
A 9.5 KB dataset supporting a study on forecasting daily Emergency Department patient arrivals. The data likely contains daily arrival counts alongside meteorological and calendar variables used to develop and compare ARIMA, ANN, LSTM, GLM, and hybrid forecasting models. Hamed Tabesh published the dataset on figshare under a CC-BY-4.0 license, with a last update timestamp of 2026-04-30.