Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
167,986 datasets
A 1.0 GB repository enables minimal reproduction of experiments for a quaternion-based gait augmentation method. The files support work on the BUT, SIGNET, and 200GAIT datasets. Author Aleksander Sawicki published it on figshare in May 2026.
WorkCover Queensland Contract Disclosure Reporting lists all awarded contracts valued over $10,000. The dataset is published by WorkCover Queensland under a CC-BY-4.0 license and was last updated on May 23, 2026. It provides structured records of government procurement activities.
Supplementary tables for a manuscript on Late Neoarchean geological processes. The dataset includes whole-rock geochemistry, mercury concentration and isotopic composition for Archean igneous samples from the Eastern Block of the North China Craton, along with summary literature data and reference material analyses. It was authored by Runsheng Yin and last updated on May 10, 2026.
2001 to 2010 monthly estimates of hillslope cover erosion measured in tonnes per hectare per month over New South Wales. The data was produced by the NSW Department of Climate Change, Energy, the Environment and Water and is available under a CC-BY-4.0 license. The dataset was last updated on the platform in May 2026.
Lena Vila Vilardell published data on the minor impacts of prescribed burning on black pine needle terpenes and pine processionary moth larval survival. The dataset is 27.6 MB in size and was last updated on May 24, 2026. It is available in CSV and TXT formats under a CC-BY-4.0 license.
Korean Time-Sensitive Q&A Dataset (KoTSQA) is a split version for machine learning, containing 6,750 test examples and 750 train examples intended for reinforcement learning. The dataset was created by ETRI LIRS and last updated on 2026-05-29.
HTMD tutorial data is a 5.2 GB collection for use with the HTMD software. The data is intended for Markov State Model analysis and Adaptive Sampling simulations. It was authored by Stefan Doerr and last updated on June 2, 2026.
A dataset of Extended Time-to-Collision (ETTC) ranges for longitudinal and lateral vehicle conflicts, derived from 12 interchange diverging areas on two multilane freeways in China. The data was created by Feng Tang using image recognition to extract 48 vehicle motion parameters and surrogate safety measures, with conflict labels at 30-second intervals. It was last updated on April 29, 2026, and is shared under a CC-BY-4.0 license.
Twelve interchange diverging areas from two multilane freeways in China were analyzed using image recognition to extract vehicle motion parameters. The dataset contains longitudinal and lateral conflict labels at 30-second intervals, used to develop and compare four machine learning models. It was created by Feng Tang and last updated on 2026-04-29.
48 vehicle motion parameters and surrogate safety measures were extracted from 12 interchange diverging areas on two multilane freeways in China. The dataset, created by Feng Tang, contains longitudinal and lateral conflict labels at 30-second intervals based on Extended Time-to-Collision (ETTC). It was last updated on 2026-04-29.
Twelve interchange diverging areas from two multilane freeways in China were analyzed using image recognition to extract 48 vehicle motion parameters and surrogate safety measures. The dataset, created by Feng Tang and last updated in April 2026, innovatively applies Extended Time-to-Collision to label longitudinal and lateral conflicts at 30-second intervals. It is a 9.5 KB Excel file released under a CC-BY-4.0 license.
A dataset of vehicle motion parameters and surrogate safety measures (SSMs) extracted from 12 interchange diverging areas on two multilane freeways in China. The data includes 48 vehicle motion parameters and conflict labels at 30-second intervals, created by Feng Tang and last updated in April 2026. It is stored in an XLS file and is 9.5 KB in size.
9.5 KB of data in XLS format replicates significant local genetic correlations between systemic sclerosis (SSc) and different cancers. The dataset was authored by Karina Patasova and last updated on 2026-05-27. It is shared under a CC-BY-4.0 license on the figshare platform.
Karina Patasova published a dataset of significant local genetic correlations between systemic sclerosis (SSc) and different cancers. The dataset is stored in an XLS file sized 9.5 KB and was last updated on 2026-05-27. It is shared under a CC-BY-4.0 license on the figshare platform.
Colombian police publications from the National Police's Revista Logos Ciencia & Tecnología and books published by the National Directorate of Schools. The dataset includes records from 2019 onward, as published by the Colombian National Police. It contains metadata on titles, authors, publication dates, and associated geographic locations.
A 23.3 KB supplementary table by Joel Sanchez Mendez, last updated on 2026-06-02. It contains the weights and coordinates for single nucleotide polymorphisms used to estimate a polygenic risk score for colorectal cancer. The data is shared under a CC-BY-4.0 license on figshare.
12.1 KB Excel file containing pathway annotations for single nucleotide polymorphisms overrepresented in PANTHER pathways. The supplementary table was created by Joel Sanchez Mendez and last updated on June 2, 2026. It supports a novel pathway-based polygenic risk score method investigating red meat intake and colorectal cancer risk.
14.6 KB of supplementary data from a research article on colorectal cancer risk. The table, published by Joel Sanchez Mendez on figshare, presents associations between red and processed meat intake and cancer risk, stratified by quartiles of pathway-based polygenic risk scores. It was last updated on June 2, 2026.
A supplementary table presenting confounding analyses for the association between the interaction of a TGF-β pathway-based polygenic risk score and red meat intake in relation to colorectal cancer risk. The 12.2 KB Excel file was authored by Joel Sanchez Mendez and last updated on June 2, 2026. It is shared under a CC-BY-4.0 license on figshare.
Snow depth, snow water equivalence (SWE), snow wetness, and snow pit data collected from two pine sites and a small clearing at the Local Scale Observation Site (LSOS) in northern Colorado. The dataset was created by the National Aeronautics and Space Administration as part of the Cold Land Processes Field Experiment (CLPX). Data collection concluded in March 2003, though metadata records show a later administrative update.