Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
142,312 datasets
Certified Payroll Registration records weekly payroll filings for contractors and subcontractors on New York public works projects under Article 8. The dataset contains project details, employee hours, wage rates, and job classifications for laborers, mechanics, and building service employees. Data is structured to track compliance and payment across all tiers of subcontractors on covered projects.
Jean-Pierre Dujardin's dataset contains reclassification accuracy scores for biological landmarks after different alignment methods. The data is stored in a 9.5 KB Excel file and was last updated on June 2, 2026. It includes scores for landmarks from multiple species, such as sawadwongporni, pseudowillmori, and gambiensis.
A small dataset (9.5 KB) contains reclassification accuracy scores for biological specimens after different landmark alignment procedures. The data was authored by Jean-Pierre Dujardin and last updated on June 2, 2026. Scores are presented for least-square alignment, resistant-fit alignment, and random search, with adjustments for chance.
Information on assistance provided to citizens abroad by the Ministry of Foreign Affairs. The data includes assistance types, consulates providing the service, and a basic profile of the assisted population. It is published by www.datos.gov.co and was last updated on 2026-05-18. Columns suggest coverage includes demographic, geographic, and categorical details for each case.
England's lookup table linking Lower Layer Super Output Areas (LSOA) to NHS administrative boundaries as of 1 April 2026. The dataset includes fields for LSOA codes and names, Sub Integrated Care Board Locations (SICBL), Integrated Care Boards (ICB), NHS England Regions (NHSER), and local authority districts (LAD). It was published by the Office for National Statistics and last updated on 15 April 2026.
160 adolescents aged 11–14 played an HIV prevention game, generating 240 gameplay-derived behavioral metrics. Kammarauche Aneni analyzed this data in a proof-of-concept study to predict lifetime substance use and drug-refusal self-efficacy using machine learning models. The dataset, last updated in 2026, contains the results of this analysis.
Experimental data from a study on the late-stage renal effects of nonylphenol exposure and the potential nephroprotective role of crocin in Wistar rats. The dataset includes body and kidney weights, histopathological scores, glomerular morphometry, and biochemical markers for renal function and oxidative stress. It was authored by Esra Babaoğlu and last updated on 2026-06-02.
Mohammad Kazemi's dataset provides a comparative table of nine feature selection methods applied to flood susceptibility mapping. The data, last updated in April 2026, originates from a study using 19 environmental factors and 1,000 sample points from flood-prone Khuzestan Province, Iran. It details the consensus selection of key predictors like NDVI and daily minimum temperature, and the performance of metaheuristic-optimized LSTM models.
19 environmental factors were sourced from Google Earth Engine for flood susceptibility mapping in Khuzestan Province, Iran. The dataset contains 1,000 sample points (500 flood and 500 non-flood) used to train a model, with results generalized to the entire study area. Author Mohammad Kazemi published the data on figshare in 2026 under a CC-BY-4.0 license.
A 5.5 KB Excel table compares nine feature selection methods applied to flood susceptibility mapping. The dataset was created by Mohammad Kazemi and last updated on 2026-04-29. It is based on 19 initial environmental factors and 1,000 sample points from flood-prone Khuzestan Province, Iran.
Judit del Río published final full-vector paleomagnetic results for five kilns on figshare in 2026. The dataset includes directional and intensity measurements for specimens, with corrections for anisotropy and cooling rate effects. It is a small dataset, 5.5 KB in size, stored in an XLS file.
NASA CDDIS provides daily files of GLONASS combined broadcast ephemeris data from a global network of ground receivers. The data contains all distinct navigation messages received in one day, sampled at 30-second intervals and stored in the RINEX format. This dataset supports precise positioning, timing, and geodetic research by detailing satellite orbits and clock corrections.
From 2000 to 2016, this dataset provides hourly, per-pixel land surface temperature measurements for North and South America at a 4-kilometer spatial resolution. It is derived from data acquired by Geostationary Operational Environmental Satellite (GOES) 8 and 10 through 15 satellites. The product includes variables for cloud mask, latitude, longitude, land surface temperature, and land surface temperature error.
97 studies published between 2015 and May 2025 were systematically reviewed to map the application of artificial intelligence in behavioral analysis of invertebrate and larval model organisms. The review, authored by Zuzanna Stępnicka and published on figshare, analyzes model organisms, AI methods, input data characteristics, preprocessing pipelines, model architectures, and evaluation metrics. It proposes a standardized reporting framework to enhance transparency and reproducibility in the field.
97 eligible studies published between 2015 and May 2025 were analyzed in this systematic review. The review maps the use of artificial intelligence methods, including deep learning and machine learning, for analyzing the behavior of organisms like Drosophila melanogaster and zebrafish larvae. It was authored by Zuzanna Stępnicka and published on figshare in May 2026.
Zuzanna Stępnicka's systematic review comprehensively maps the use of artificial intelligence in behavioral analysis of invertebrate and larval organisms. It analyzes 97 eligible studies published between 2015 and May 2025, covering model organisms, AI methods, input data characteristics, preprocessing pipelines, model architectures, and evaluation metrics. The review proposes a standardized reporting framework to enhance transparency and reproducibility.
97 eligible studies published between 2015 and May 2025 were analyzed. The dataset is a systematic review mapping the application of artificial intelligence methods in behavioral analysis of invertebrate and larval organisms. It was authored by Zuzanna Stępnicka and published on figshare.
Zuzanna Stępnicka's systematic review maps the use of artificial intelligence for behavioral analysis of invertebrate and larval model organisms. The review, published on figshare in May 2026, analyzes 97 eligible studies published between 2015 and May 2025. It examines model organisms, AI methods, input data characteristics, and evaluation metrics.
97 eligible studies published between 2015 and May 2025 were analyzed in this systematic review. The review maps the application of artificial intelligence methods, including machine learning and deep learning, for automated behavioral analysis of invertebrate and larval organisms like Drosophila melanogaster and Caenorhabditis elegans. It was authored by Zuzanna Stępnicka and published on figshare in May 2026.
A systematic review mapping the application of artificial intelligence in behavioral analysis for invertebrate and larval model organisms. The dataset, authored by Zuzanna Stępnicka and published on figshare in May 2026, analyzes 97 eligible studies published between 2015 and May 2025, tracking trends in model organisms, AI methods, and data characteristics.