Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
165,331 datasets
Northern Yukon's Wernecke Mountains expose the Proterozoic Pinguicula Group, a succession of clastic and carbonate rocks. The strata were deposited after the Racklan orogeny and Hart River sill emplacement, with contact relationships clarified during a 2009 field season. The Government of Yukon published this geological study, raising questions about the group's age and correlation with the Fifteenmile Group.
The glacial history and placer gold potential dataset from the Government of Yukon provides reconstructions and geomorphic mapping for the North McQuesten River, Dublin Gulch, and Keno Hill map areas. It details a succession of glaciations, including pre-Reid, Reid, and McConnell episodes, and analyzes placer potential based on geomorphology, glacial history, geochemistry, bedrock geology, and historic records. The dataset was last updated on April 17, 2026.
MTA Bridges & Tunnels safety indicators track preventative measures and incident occurrences. The data is provided by data.ny.gov and includes monthly measurements for various metrics against targets. The dataset was last updated in May 2026.
Scherm On-Premise LLM Inference Benchmark v0.5.1 provides performance data for large language models across 9 real GPUs, from the NVIDIA B200 to older consumer cards like the GTX 1080 Ti. The benchmark includes metrics like throughput (tok/s), VRAM usage, and tensor-parallel scaling, measured with a methodology using a seed of 1234, 10 repetitions per point, and input/output lengths of 512 and 256 tokens. It was created by Scherm-AI and last updated on 2026-06-16.
LeRobot was used to create this dataset, which likely contains teleoperation records for a robotic arm. The dataset features include action data with six joint positions for a manipulator. It was authored by ohdoking and uploaded to Hugging Face on June 20, 2026.
1.24-meter spatial resolution multispectral imagery was collected by the WorldView-4 satellite across the global land surface from December 2016 to January 2019. The data contains four spectral bands—blue, green, red, and near-infrared—and is provided in NITF and GeoTIFF formats as sensor-corrected Level 1B products. Its high temporal resolution of approximately 1.1 days supports detailed monitoring of land surface changes.
A comparison of shared and co-aperture antenna designs created by Abdul Rehman Chishti and published in 2026. The dataset is 5.5 KB in size and focuses on application, size, and gain parameters. It is available under a CC-BY-4.0 license.
9.5 KB of computed hemodynamic parameters under four stenosis severities (30%, 50%, 70%, and 90%) and three blood viscosity conditions (below-normal, normal, and high). The dataset was authored by Lei Zhengyao and last updated on 2026-05-28.
17.4 KB of data from a mixed-effects analysis investigating the link between pre-competition strength metrics and sprint canoe/kayak performance. The dataset was authored by Zongwei Chen and last updated on May 28, 2026. It likely contains measurements from professional Chinese athletes.
17.4 KB of data from a mixed-effects analysis investigating the link between pre-competition strength metrics and sprint canoe/kayak performance. The dataset was authored by Zongwei Chen and last updated on May 28, 2026. It likely contains measurements from professional Chinese athletes.
S1 File contains data for the study 'Association between pre-competition strength and sprint canoe/kayak performance: A mixed-effects analysis of professional Chinese athletes'. The dataset is 33.3 KB in size, stored as an XLSX file, and was authored by Zongwei Chen. It was last updated on 2026-05-28.
Property-related charge information by period, sourced from data.cityofnewyork.us. The dataset includes columns such as TAXYEAR, SUM_BAL, VALCLASS, and PARID, but a technical issue means 2023 data is missing and cannot be recreated from this snapshot. It was last updated on 2026-05-21.
Part16–part19 of the WildGUI dataset contain screenshot images, extending the main release at xwm/WildGUI. The dataset was introduced by Video2GUI and is hosted by author joker-112. The repository was last updated on 2026-06-14.
Replication data and code for a study analyzing the impact of natural disasters on corporate performance in China. The dataset, approximately 864 MB in size, likely contains firm-level financial and operational metrics linked to disaster events. It supports research into how environmental shocks affect business outcomes such as profitability and innovation.
Almost 10 times the number of light commercial vehicles were on-road in Queensland compared with heavy freight vehicles as of 30 June 2019. The number of registered light commercial vehicles more than doubled since 30 June 2001, while heavy freight vehicles increased by 49% in the same period. This dataset is provided by the Queensland Department of Environment, Tourism, Science and Innovation.
Australia's Southeast Marine Region dataset from the Australian Ocean Data Network provides 3D images and descriptive text about the marine environment. The dataset was last updated on 2026-06-17. It is available in HTML and PDF formats.
Supplementary material 4 from a study on decadal seafloor geodesy along the Nankai Trough. The dataset contains the average of standard deviations for coefficients used in estimating slip deficit rates for two directions, labeled "02" and "03". It was authored by Yusuke Yokota and is shared under a CC-BY-4.0 license.
Australian Ocean Data Network provides a record of gravity and magnetic data sources covering the remote offshore Capel and Faust basins on the Lord Howe Rise. The documentation describes the processes applied to level the collected geophysical data. This dataset was last updated on 2026-06-17.
57% of 112 surveyed German healthcare professionals treating cardiology patients reported using telemedicine. This dataset contains predictors of telemedicine use identified via Bayesian Model Averaging and an XGBoost model achieving 0.88 AUROC, created by Pascal Petit and last updated in April 2026. It likely includes variables related to professional role, knowledge, attitudes, and demographics.
112 healthcare professionals from a German cross-sectional survey provide data on telemedicine use determinants. The dataset contains the performance metrics and predictor importance results from a final XGBoost model developed by Pascal Petit, last updated in April 2026. The model achieved an AUROC of 0.88 and 79% accuracy in predicting telemedicine adoption.