Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
169,570 datasets
An overview comparing the economic paradigms of green growth and degrowth. The dataset was authored by Dallas O’Dell and last updated on May 28, 2026. It is a small 17.5 KB Excel file available under a CC-BY-4.0 license.
A comparison with prior work for short-term prediction of acute exacerbations of chronic obstructive pulmonary disease (AECOPD). The dataset is a 9.5 KB Excel file authored by Florian Tilquin and shared under a CC-BY-4.0 license on figshare.
A 5.5 KB Excel file uploaded by Florian Tilquin on May 28, 2026. It contains performance metrics for the BVS³ algorithm and individual vital-signs Z-scores for the early detection of Acute Exacerbation of Chronic Obstructive Pulmonary Disease (AECOPD) events. The dataset is shared under a CC-BY-4.0 license on figshare.
A curated collection of traditional and modern Japanese haiku masterworks. The dataset includes authors, seasonal classifications, specific seasonal keywords, and detailed contextual explanations in both English and Japanese. It was created by shigr3 and last updated on June 8, 2026.
Stage 3 complaints in 2014 required coordinated corrective action overseen by the Chief Executive. This log details the final escalation stage for all complaints that year, providing a record of required resolutions. The dataset offers insight into the outcomes of the most serious public grievances handled by the council.
UK Government Digital Service annual statements of accounts detail its financial activities and overall position to meet statutory requirements and proper accounting practice. The dataset covers a four-year period from 2012 to 2015, providing a technical record of government financial operations. Its presence on multiple open data platforms indicates it is a formal, recurring public record.
Lincolnshire County Council's annual transparency data details its counter fraud activities, including fraud referrals and investigations. The dataset is published under the UK Local Government Transparency Code and updated each June. Figures for fraud and irregularities are likely identical, as they are not categorized differently in practice.
Graduate statistics from INTEP's academic programs across technical, technological, and professional cycles. The data includes details on formation level, semester, municipality, period, academic unit, program, year, student type, and modality. It is hosted on the Colombian open data portal, datos.gov.co, and was last updated on May 21, 2026.
Mapeo de Entidades Públicas was created by the Open Data group of the Colombian Ministry of Information and Communications Technologies (MinTIC). The dataset includes columns for entity names, codes, addresses, and geographic coordinates for municipalities and entities. It was last updated on 2026-05-26 17:49:48.
17.9 MB of data and R code accompany a study on rapid decreases in Northeast Pacific seamount foundation species. The dataset, authored by Lindsay Clark and last updated in April 2026, is available under a CC-BY-4.0 license and includes CSV and XLSX files.
Xingyu Zhang's dataset, published on figshare in April 2026, provides simulation data for understanding chemical degradation in Nafion membranes used in fuel cells. The 2.6 KB CSV file contains data used to train interpretable machine learning and deep learning models for predicting proton transport properties. The models leverage a high-fidelity, multiscale simulation dataset to link nanostructure, hydration, temperature, and degradation.
Floridablanca, Colombia traffic violation reports for the year 2020. The dataset includes columns such as INFRACCION (infraction), FECHA (date), and PLACA (license plate). It is hosted on the datos.gov.co platform via Socrata and was last updated on 2026-05-18.
A database containing the number of employees for companies registered with the Chamber of Commerce of Magdalena Medio and Northeast Antioquia. The dataset includes columns for company name (RAZON SOCIAL), tax ID (NIT), employee count (PERSONAL), and municipality of commercial activity (MUN-COMERCIAL). It was published on the Colombian open data portal www.datos.gov.co and was last updated on 2026-05-18.
Montgomery County of Maryland's Alcohol Beverage Services Breakage Inventory dataset logs instances of broken or wasted inventory. The monthly updated log includes columns for Item, Quantity, Unit Of Measure, Item description, and Date. This data provides a record of inventory loss for government-managed alcohol sales.
Kinematics and balance performance data from a study of healthy younger adults under different neck bracing conditions. The dataset, authored by Emily Eichenlaub, is available on figshare under a CC-BY-4.0 license and was last updated on 2026-05-28. The file is 27.9 KB in size.
The 2001 third edition of the Directory of Important Wetlands of Australia provides the source listing for this geospatial dataset. It contains boundaries for wetlands in Victoria, Australia, derived from the 1994 wetland layer, hydrological data, and topographic maps, with updates in 2017 improving boundary accuracy. The dataset is published by the Department of Energy, Environment and Climate Action under a CC-BY-4.0 license.
Supplementary Material 4 accompanies the research article 'S-nitrosylation of Annexin A2 at Cys133 ameliorates pulmonary arterial hypertension by inhibiting the WNT/β-catenin pathway'. The dataset is published on figshare by author Yu Wande under a CC-BY-4.0 license. It is a 547.2 KB CSV file, last updated on 2026-06-02.
Corpoboyaca, Colombia, maintains this historical record of atmospheric emission permits. The dataset includes columns such as Tipo de Emision (Emission Type), Municipio (Municipality), Solicitante (Applicant), and coordinates (X, Y). It was last updated on 2026-05-25 and is provided by the Colombian open data portal www.datos.gov.co.
A chemical dataset from figshare, authored by Zerong Guo and last updated on 2026-05-08. It contains data related to a dearomative multifunctionalization strategy for converting planar isoquinoliniums into three-dimensional bridged bis-tetrahydroisoquinolines. The dataset includes physicochemical property evaluations and PMI analysis to confirm three-dimensionality and drug-like profiles.
Urban density data from the Bundesamt für Kartographie und Geodäsie measures the intensity of building use in Berlin's planning spaces (LOR). The dataset provides the GFZ (base area number), a ratio of total floor area to plot area, as of November 2009. It is served via a WMS (Web Map Service) format.