Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
159,442 datasets
1.2 MB of geospatial data outputs from research analyzing global drought risk in cities. The datasets were created by author Tristian Stolte and last updated on 2026-05-29. They are shared under a CC-BY-4.0 license as a ZIP file on figshare.
Matrículas - Instituciones Educativas Garzón Huila provides detailed 2024 enrollment figures for schools in the municipality of Garzón, Huila, Colombia. The dataset includes counts by educational level and institutional details, sourced from the Colombian open data portal datos.gov.co. It was last updated on 2026-05-18.
A 79.7 MB ZIP file containing supplementary data for a study on S-palmitoylation. The dataset, authored by Qi Liang and last updated on May 22, 2026, is licensed under CC-BY-4.0 and hosted on figshare. It likely contains experimental results related to the regulation of mitochondria-associated endoplasmic reticulum membrane function.
A 22.7 KB Excel file containing supporting data for a study on S-palmitoylation's role in regulating mitochondria-associated endoplasmic reticulum membrane function to alleviate nucleus pulposus cell senescence. The dataset was authored by Qi Liang and last updated on May 22, 2026. It is shared under a CC-BY-4.0 license on figshare.
Wenxin Jiang published a dataset on figshare in May 2026 detailing frequent adverse events in a clinical trial. The dataset lists grade 3 or higher adverse events with an incidence greater than 5% for each treatment group. It is a 5.5 KB Excel file available under a CC-BY-4.0 license.
UK-based qualitative data from the HIIT or MISS clinical trial, consisting of themes and subthemes. The dataset was authored by Charlotte Williams and is available under a CC-BY-4.0 license. It was last updated on 29 May 2026.
A 5.5 KB Excel file containing data related to a hybrid forecasting method for energy storage power stations. The dataset likely contains results or inputs from a model integrating chaos theory, signal decomposition, and deep learning, optimized with an adaptive genetic algorithm. It was authored by Lingzhi Xi and last updated on April 24, 2026.
5.5 KB of data supporting a paper proposing a hybrid forecasting method for day-ahead power generation at energy storage stations. The dataset, authored by Lingzhi Xi and last updated in April 2026, likely contains analysis results from experiments using operational data from a 10 MW/20 MWh electrochemical energy storage power station. The proposed model achieved a Mean Squared Error of 6.41 MW² and a Coefficient of Determination (R²) of 0.898.
5.5 KB of experimental results from a hybrid forecasting model applied to a 10 MW/20 MWh electrochemical energy storage power station. The dataset, authored by Lingzhi Xi and uploaded to figshare in April 2026, contains performance metrics including Mean Squared Error (MSE), Mean Absolute Error (MAE), and Coefficient of Determination (R²) for day-ahead 24-hour power generation forecasts.
pre_data is a table of plume detection results, including time, latitude, longitude, emission flux, and 1σ uncertainty. The dataset was authored by Zunze Zhang and last updated on June 3, 2026. It is a small dataset, 13.6 KB in size, and is available in CSV format under a CC-BY-4.0 license.
Colombia's proactive information disclosure schema, published via datos.gov.co. The dataset catalogs published and planned information from public bodies, with columns for generation date, format, language, and update frequency. It was last updated on 2026-05-18.
Pranav Joshi's dataset contains qualitative research on digital transformation in the Indian advertising sector. The data is stored in a 1.1 MB PDF file and was last updated on June 4, 2026. It is shared under a CC-BY-4.0 license on the figshare platform.
A study proposing a dimensionless number for analyzing dynamic plastic deformation of clamped rectangular plates under underwater explosion loads. The dataset includes experimental data used to establish a correlation between the dimensionless plastic deformation and the proposed number. The 316.0 KB PDF file was authored by Weizheng Xu and last updated on 2026-05-18.
High capacity wells data from Prince Edward Island, Canada. The dataset is published by the Government of Prince Edward Island on the open_canada platform. It was last updated on 2026-06-10.
BMRS is a dataset of Bongard–Maximov problems for remote sensing, published on Preprints.org in June 2026. The dataset is authored by Nikita Firsov, Olga Terekhova, and colleagues. It was last updated on the Hugging Face platform on 2026-06-22.
Daily-updated dataset of arXiv papers from AI/ML and adjacent categories, enriched with LLM-derived signals. It includes a 0–100 importance score, topical/lab tags, a one-line takeaway, and dense full-page summaries for a selected subset. The dataset is published by author taesiri and was last updated on 2026-06-17.
curt is a machine-first programming language designed for AI agents with a focus on output-token cost. This dataset contains the complete evaluation record for language version 0.2, including benchmark suites, model-generated programs, and reference materials. The dataset was created by therikkening and was last updated on June 12, 2026.
Historical information on Colombian beneficiaries of international scholarship calls from 2018 to 2024, grouped by various demographic and program variables. The data is provided by www.datos.gov.co and was last updated on 2026-05-26. Columns suggest records for MODALIDAD, GÉNERO, PAÍS DE DESTINO, and ESTRATO SOCIOECONOMICO DE RESIDENCIA.
Submissions and evaluation results for the CADGenBench leaderboard. The dataset contains one row per submitted and evaluated entry, as read by the leaderboard table. It was created by HuggingAI4Engineering and last updated on June 10, 2026.
GNS3 file exports were created as part of a master's thesis at NTNU. The files can be downloaded and imported into GNS3 to extract and run the network topology used in the thesis titled 'IPsec tunnels between end user devices behind NAT'. The dataset was authored by Sindre Revheim Svellingen and last updated on 2026-06-21.