Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
149,839 datasets
Visibility-reducing particle data tracks days with visual distance less than 20km across Queensland regions from at least 2000 to 2019. The dataset shows a downward trend over two decades but highlights specific years with more than 10 reduced visibility days due to dry conditions and widespread bushfires. It is provided by the Queensland Department of Environment, Science and Innovation under a CC-BY-4.0 license.
Population and mortality case numbers for children under five due to malnutrition in the Antioquia department of Colombia from 2005 to 2024. The data is updated annually with validated and closed figures from the previous year, sourced from www.datos.gov.co. It enables the calculation of annual mortality rates for public health analysis.
Australian Ocean Data Network collected temperature logger data around Bedarra Island. The dataset covers a time range from July 7, 2016, to May 19, 2026. It was last updated on June 4, 2026.
From November 19, 2009, to August 13, 2017, salinity data was collected at Orpheus Island. The data was gathered by the Great Barrier Reef Wireless Sensor Network, part of the Australian Integrated Marine Observing System's Great Barrier Reef Ocean Observing System project. It is managed by the Australian Ocean Data Network.
Wireless Sensor Networks Facility data from the Great Barrier Reef Ocean Observing System project. This hail dataset was collected by the Great Barrier Reef Wireless Sensor Network, part of the Australian Integrated Marine Observing System. The data is managed by the Australian Ocean Data Network and was last updated on 2026-06-04.
Spatial Services maintains a dynamic map of administrative and property boundaries for New South Wales. The web service includes polygon data for Counties, Suburbs, Parishes, Local Government Areas, State Forests, National Parks, and State Electoral Districts. Last updated on 2026-05-13.
Sheel Chandra's 172.4 KB Excel file provides scaled mutation rate estimates for different CpG sequence contexts. The data is organized into separate worksheets for each species and model, reporting context, methylation status, rate estimates, and standard errors. It was last updated on June 1, 2026, and is shared under a CC-BY-4.0 license.
9,000 AC sweep frequency traces simulate 9 distinct circuit states, including normal operation and 8 fault modes, for a Sallen-Key low-pass filter. The dataset incorporates 5% and 10% manufacturing tolerances and provides both raw and preprocessed data with added noise. Author Jianjun Zhong published this benchmark on figshare in April 2026 under a CC-BY-4.0 license.
NASA's Sentinel-6A Michael Freilich spacecraft provides reprocessed Level 2 high-resolution non-time-critical altimetry data from its Poseidon-4 SAR instrument. This dataset contains sea surface height, sea surface height anomalies, and significant wave height, along with 1 Hz and 20 Hz Ku-band measurements and corrections from the AMR-C instrument. The product is analogous to the Jason-3 GDR standard and is available in both a standard release with 1 Hz and 20 Hz data and a reduced release with only 1 Hz observations.
An Origin project file containing the SHAP value analysis for a Random Forest model. The file presents feature contributions of input variables to predictions of displacement and ultimate load, along with feature importance comparisons. It was authored by Hongtao Zhang and last updated on June 1, 2026.
An origin project file containing a correlation coefficient matrix for variables in a concrete-filled steel tube study. The 70.5 KB file shows relationships among outer steel tube size, tube thickness, member length, steel yield strength, concrete strength, ultimate load, and displacement. It was authored by Hongtao Zhang and last updated on June 1, 2026.
An origin project file containing evaluation results for load prediction models. The data compares the performance of KNN, Decision Tree, XGBoost, and Random Forest models using R², RMSE, MAE, and MSE metrics. The 201.3 KB file was authored by Hongtao Zhang and last updated on June 1, 2026.
Xiao Liang's dataset quantifies gene distribution patterns across 31 non-human primates and 4 non-primate species. It defines a 'primate specific ratio' based on gene set counts identified across all primates, subsets of primates, and absent in non-primates. The dataset was last updated on May 11, 2026, and is shared under a CC-BY-4.0 license.
Experimental data from a study on multi-service scheduling for intelligent manufacturing platforms. The 133.3 KB CSV file supports research on scheduling algorithms considering personalized customization and data security. Authors are listed for anonymous review purposes, and the dataset was last updated on May 27, 2026.
A wastewater sampling campaign in November 2015 measured acute lethality and physicochemical quality of untreated water discharged into a river. The work was conducted on the south-east interceptor between 11 and 14 November 2015. Government and Municipalities of Québec published the data under a CC-BY-4.0 license.
A historic record of Land Act Crown grants issued between April 1, 2000 and March 31, 2009. Crown grants are instruments used to permanently convey Crown land under fee simple title, including direct sales, sponsored grants, lease to purchase, and land exchanges. This data was sourced from TANTALIS and used to inform the Ministry of Forests, Lands and Natural Resource Operations Crown Land Indicators and Statistics report.
19-131 Reef sea water temperature data collected by deployed loggers from 12 March 2011 to 17 April 2026. The dataset is provided by the Australian Ocean Data Network and was last updated on 4 June 2026.
Australian Ocean Data Network collected sea water temperature data from loggers deployed at 19-138 Reef. The time series spans from 14 March 2011 to 21 April 2026. The dataset was last updated on 4 June 2026.
A research paper comparing multilayer perceptron neural networks to logistic regression for optimizing steel structure design. The study evaluates different ANN configurations to find effective arrangements for reducing weight, costs, and environmental impact. The work was authored by Amirhossein Ostovar and published on figshare under a CC-BY-4.0 license.
Puntos de Conectividad Itagüí Inteligente Digital maps the exact locations of free community Wi-Fi access points in the Colombian municipality of Itagüí. The dataset is published by the Colombian open data portal www.datos.gov.co and was last updated on May 18, 2026. It likely contains point coordinates and descriptive information for each connectivity hub.