Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
164,782 datasets
Canada.ca analytics provides monthly usage data for the Government of Canada's primary website. The dataset covers the past 36 months and is published by Employment and Social Development Canada. It is available as an unaltered CSV file under the OGL-CA-2.0 license.
From 18 March 2011 to 18 April 2026, temperature loggers collected this time-series data around Rebe Reef. The dataset was aggregated by the Australian Ocean Data Network and last updated on 4 June 2026. It likely contains continuous temperature readings for monitoring reef environments.
460 NSCLC patient records from two hospitals, with an external validation cohort of 50 patients. The data includes preoperative neutrophil-to-lymphocyte ratio (NLR), platelet-to-lymphocyte ratio (PLR), systemic immune-inflammation index (SII), and systemic inflammation response index (SIRI) values, along with survival outcomes. This retrospective study by Xinying Cai, last updated in April 2026, used logistic regression and machine learning techniques to assess prognostic value.
United Kingdom data details provisional awards from the 30th Petroleum Licensing Round, focusing on the Southern North Sea (SNS). The dataset is published by the Government Digital Service and appears on both UK and EU open data platforms, indicating its official status. It likely contains geospatial information on awarded blocks or areas for hydrocarbon exploration.
Geodatabase accompanies the Midland Valley Shale report published by the British Geological Survey and the North Sea Transition Authority. It contains geospatial data layers relevant to shale geology and resource exploration in the Midland Valley region. The dataset is hosted by the UK Government Digital Service and is accessible via an ArcGIS GeoServices REST API.
32nd Round Provisional Award Map (NNS) is a geospatial dataset detailing the provisional awards for the UK's 32nd Offshore Petroleum Licensing Round in the Northern North Sea. The map, associated with the North Sea Transition Authority (NSTA), provides a quadrant-based view of areas under consideration for oil and gas exploration licenses. Its cross-platform presence on UK and EU open data portals signals its regulatory importance.
Visibility-reducing particle data tracks days with visual distance less than 20km across Queensland regions from at least 2000 to 2019. The dataset shows a downward trend over two decades but highlights specific years with more than 10 reduced visibility days due to dry conditions and widespread bushfires. It is provided by the Queensland Department of Environment, Science and Innovation under a CC-BY-4.0 license.
Australian Ocean Data Network collected temperature logger data around Bedarra Island. The dataset covers a time range from July 7, 2016, to May 19, 2026. It was last updated on June 4, 2026.
Population and mortality case numbers for children under five due to malnutrition in the Antioquia department of Colombia from 2005 to 2024. The data is updated annually with validated and closed figures from the previous year, sourced from www.datos.gov.co. It enables the calculation of annual mortality rates for public health analysis.
From November 19, 2009, to August 13, 2017, salinity data was collected at Orpheus Island. The data was gathered by the Great Barrier Reef Wireless Sensor Network, part of the Australian Integrated Marine Observing System's Great Barrier Reef Ocean Observing System project. It is managed by the Australian Ocean Data Network.
PaperBite Assets provides structured analysis notes and visual assets for AI/ML research papers. The dataset includes approximately 40 MB of Markdown analysis notes and indexes, plus about 1.8 GB of figures, tables, and rendered visuals. It was created by RipeMangoBox and last updated on June 7, 2026.
Wireless Sensor Networks Facility data from the Great Barrier Reef Ocean Observing System project. This hail dataset was collected by the Great Barrier Reef Wireless Sensor Network, part of the Australian Integrated Marine Observing System. The data is managed by the Australian Ocean Data Network and was last updated on 2026-06-04.
Spatial Services maintains a dynamic map of administrative and property boundaries for New South Wales. The web service includes polygon data for Counties, Suburbs, Parishes, Local Government Areas, State Forests, National Parks, and State Electoral Districts. Last updated on 2026-05-13.
Sheel Chandra's 172.4 KB Excel file provides scaled mutation rate estimates for different CpG sequence contexts. The data is organized into separate worksheets for each species and model, reporting context, methylation status, rate estimates, and standard errors. It was last updated on June 1, 2026, and is shared under a CC-BY-4.0 license.
9,000 AC sweep frequency traces simulate 9 distinct circuit states, including normal operation and 8 fault modes, for a Sallen-Key low-pass filter. The dataset incorporates 5% and 10% manufacturing tolerances and provides both raw and preprocessed data with added noise. Author Jianjun Zhong published this benchmark on figshare in April 2026 under a CC-BY-4.0 license.
An Origin project file containing the SHAP value analysis for a Random Forest model. The file presents feature contributions of input variables to predictions of displacement and ultimate load, along with feature importance comparisons. It was authored by Hongtao Zhang and last updated on June 1, 2026.
An origin project file containing a correlation coefficient matrix for variables in a concrete-filled steel tube study. The 70.5 KB file shows relationships among outer steel tube size, tube thickness, member length, steel yield strength, concrete strength, ultimate load, and displacement. It was authored by Hongtao Zhang and last updated on June 1, 2026.
An origin project file containing evaluation results for load prediction models. The data compares the performance of KNN, Decision Tree, XGBoost, and Random Forest models using R², RMSE, MAE, and MSE metrics. The 201.3 KB file was authored by Hongtao Zhang and last updated on June 1, 2026.
Xiao Liang's dataset quantifies gene distribution patterns across 31 non-human primates and 4 non-primate species. It defines a 'primate specific ratio' based on gene set counts identified across all primates, subsets of primates, and absent in non-primates. The dataset was last updated on May 11, 2026, and is shared under a CC-BY-4.0 license.
Experimental data from a study on multi-service scheduling for intelligent manufacturing platforms. The 133.3 KB CSV file supports research on scheduling algorithms considering personalized customization and data security. Authors are listed for anonymous review purposes, and the dataset was last updated on May 27, 2026.