Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
153,745 datasets
Primer sequences used for RT-qPCR validation of potential key genes. The dataset is a 22.5 KB Excel file authored by Haiming Liang and shared under a CC-BY-4.0 license. It was last updated on June 2, 2026.
Nagara Wakgari Futasa authored this dataset on TH softening technology. The dataset is a 13.5 KB Excel file available under a CC-BY-4.0 license. It was last updated on June 2, 2026.
Fit statistics for latent class models ranging from 2 to 6 classes, published by Shannon A. H. Compton. The dataset is a 5.5 KB Excel file hosted on figshare and last updated in May 2026.
Spatial layers from Ku-Ring-Gai Council detail flood characteristics for design floods ranging from a 20% Annual Exceedance Probability to the Probable Maximum Flood. The dataset is hosted on data.gov.au and was last updated in May 2026. It provides flood map outputs for the Middle Harbour Northern Catchments area.
User profiles for projects registered with the Rural Development Agency for the 2020 fiscal period. The data is used for review in constructing comprehensive agricultural and rural development projects. It originates from the Colombian open data portal, datos.gov.co, and was last updated on 2026-05-18.
Yotoco's 2020 Annual Acquisition Plan details planned government purchases, including estimated values and contract details. The dataset includes columns for estimated contract duration, selection modality, UNSPSC codes, and funding sources. Data is provided by the Colombian open data portal, www.datos.gov.co, and was last updated in May 2026.
Wikipedia PT Categories is a Portuguese clustering evaluation dataset containing 2,873 articles from pt.wikipedia.org, each labeled with one of 15 broad topic categories. The dataset was created by tardellirs and serves as the source for the WikipediaPTCategoriesClusteringP2P task in the MTEB(por) benchmark. It was last updated on 2026-06-08.
Colombia's ICFES institute maintains this registry of its information assets available to the public. The dataset lists categories of information, their formats, availability, and physical or digital locations. It was last updated on 2026-05-18.
2022 data from Lac-Saint-Jean and Saint-Maurice River sectors delineates flooded areas exceeding established cartographic flood zones. Photogrammetric capture from aerial photographs was used to map the farthest water limits reached during flooding events. The dataset supports the Plan for the protection of the territory against floods (PPTFI).
Over 1,300 convents and monasteries in the geographical area affected by the German Peasants' War (1524-1526) are listed with coordinates and information on the war's effects. The dataset was provided by the 'Visualising the Destruction of Convents and Monasteries in the German Peasants' War' project team at Oxford and Royal Holloway. It is available for download in XLSX format.
From 2006 to April 2026, this database contains all competency conflicts presented to the Constitutional Court of Colombia. The data was last updated on May 4, 2026, and is provided by the platform www.datos.gov.co. It includes columns for case file number, subject matter, date, and case type.
United Nations Security Council decisions from 1999 onward containing keywords related to the Protection of Civilians. The Security Council Affairs Division created this dashboard as an information resource for the Repertoire of the Practice of the Security Council. The data was last updated on 2026-05-20.
Academic program information offered by the Center for Aeronautical Studies of Aerocivil for continuing education. The dataset includes columns for activity name, cost, modality, duration, target audience, objectives, and study plan. It is hosted on the Colombian open data portal www.datos.gov.co and was last updated on May 18, 2026.
A collection of video files with action annotations documenting the initial stage of basket trap construction. The dataset, created by Marie-Annick Moreau, includes footage from carving sticks to tying them onto the top ring. It was last updated on June 3, 2026, and is shared under a CC-BY-NC-SA 4.0 license.
Infrastructure Australia created this geospatial dataset for the 2019 Australian Infrastructure Audit. It represents average weekday transport crowding performance during the PM peak period from 4pm to 6pm in 2016. The data models strategic transport conditions, excluding network links below daily volume thresholds.
A list of the top 20 pain products sold by a retailer, which collectively accounted for 53% of total menstrual product sales. The data covers sales between 30th April 2006 and 16th April 2015. It was authored by Victoria Sivill and published on figshare under a CC-BY-4.0 license.
Standard error of estimate (σ_est) for predictions made by the DAMM model regarding fecal short-chain fatty acid chemical oxygen demand. The 5.5 KB XLS file, authored by Taylor L. Davis and last updated in May 2026, quantifies the error for predictions against an identity line where predictions should equal measurements.
Matthew N. Ponticiello's dataset records changes in interest, perceived difficulty in accessing, and perceived importance of initiating medications for opioid use disorder before and after a brief intervention. The data covers 117 participants on probation with opioid use disorder. It was last updated on 2026-05-27 and is shared under a CC-BY-4.0 license.
A dataset supporting a machine learning model for engineering porous biochar for CO2 adsorption. The gradient boosting regression model uses biomass composition, pyrolysis, activation, and adsorption conditions as inputs, achieving an R² of 0.99 and RMSE of 0.15. The dataset, created by Chengkai Cao and last updated in May 2026, is provided in an XLSX file.
City of Hobart drainage catchment data includes polygon geometries for each catchment area. Associated database entries provide the catchment name and its calculated area. The dataset is maintained by the Hobart City Council's GIS team for environmental and stormwater management purposes.