Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
169,350 datasets
Country-Based Pooled Funds (CBPFs) data contains approved project allocations and contributions received for Myanmar. The dataset is provided by the Country-Based Pooled Funds (OCHA) organization. It was last updated on 2026-05-19.
Ethiopia's Country-Based Pooled Funds (CBPFs) approved project allocations and received contributions. The dataset is provided by the Country-Based Pooled Funds (OCHA) and was last updated on 2026-05-19. It is available in CSV format under a CC-BY-4.0 license.
Approved project allocations and contributions for Country-Based Pooled Funds (CBPFs) in the Democratic Republic of the Congo (DRC). The dataset is provided by the United Nations Office for the Coordination of Humanitarian Affairs (OCHA) and was last updated on 2026-05-19.
Country-Based Pooled Funds (CBPFs) data contains approved project allocations and contributions received for Afghanistan. The dataset is published by the United Nations Office for the Coordination of Humanitarian Affairs (OCHA). It was last updated on 2026-05-19.
Approved project allocations from Country-Based Pooled Funds (CBPFs) and the contributions received by each fund for Somalia. The dataset is published by the Country-Based Pooled Funds (OCHA) and was last updated on 2026-05-19.
Country-Based Pooled Funds (CBPF) allocations and contributions for South Sudan, managed by OCHA. The dataset contains approved project allocations and the contributions received by each fund. It was last updated on 2026-05-19.
Country-Based Pooled Funds (CBPFs) approved project allocations and contributions for the Central African Republic. The dataset is published by OCHA's Country-Based Pooled Funds and was last updated on 2026-05-19. It is available under a CC-BY-4.0 license.
Country-Based Pooled Funds (CBPF) allocations and contributions for Sudan, managed by OCHA. The dataset contains approved project allocations and the contributions received by each fund. It was last updated on 2026-05-19.
Four black spruce forest sites burned in 1930, 1964, 1981, and 1989 provide a rare temporal sequence for studying boreal wildfire recovery. Paired portable eddy flux systems collected 4-6 weeks of peak-season data at each site in 1999 or 2000, measuring carbon, water, and energy exchanges. This data is part of a larger age-sequence study aiming for year-round measurements across seven sites spanning 2 to 150 years post-fire.
A tabular file contains information on known Harvard repositories on GitHub. It includes metrics such as the number of stars, programming language, day last updated, number of open issues, size, number of forks, repository URL, create date, and description. The dataset was created by Philip Durbin and last updated on June 25, 2026.
A dataset from datos.gov.co last updated on 2026-05-18. It compiles information on institutional and business actors driving economic development in the Risaralda department of Colombia. The data includes columns such as Estado Rnt, Municipio, Razon Social, and Descripcion Categoria.
95 GLMM files for Crown-of-Thorns Starfish orientation behavior, excluding third-and-later repeated trials. The dataset includes variables for individual ID, reproductive phase, seawater inflow rate, surface temperature, movement result, coral species count, coral mass, and sex. It was authored by Masumi Kamata and last updated on May 5, 2026.
European legislatures from 1980 to 2021 provide the scope for this country-year panel dataset. It contains replication materials for an article on party institutionalisation, party-system closure, polarisation, and recorded physical violence. The package includes analytic data, Python code for reproducing descriptive tables, PPML models, robustness checks, and figure source data.
Source data files for all figures in a scientific manuscript on the Zwan-Wolf effect. The 9.7 MB ZIP archive contains individual ASCII files, each corresponding to a specific figure, and includes a README for context. Author Christopher Fowler published the dataset on figshare under a CC-BY-4.0 license in May 2026.
Laurentian Great Lakes daily gridded ice concentration data from the NOAA Great Lakes Environmental Research Laboratory (GLERL). This dataset provides consistent daily records starting in 1973, supporting long-term climate and hydrological studies. Data are available in multiple formats including ASCII text files, shapefiles, and browse images.
A replication database for the paper 'Disconnection and Disorder: Internet Shutdowns, Protest Violence, and Infrastructural Coercion in the Global South'. It contains original merged datasets and scripts. The data was authored by ZHANG, YANG and hosted on Harvard Dataverse, last updated on 2026-06-26.
Dataverse at UCLA hosts datasets and scripts for the study 'From Satellite Observations to Machine Learning: Predicting Plume Injection Fraction and Its Impacts on Smoke Pollution Across the Western United States'. The collection includes GEOS-Chem model simulations, GFED fire emissions, MISR plume height data, EPA and IMPROVE air quality observations, and random forest models for predicting daily plume injection fractions. The data was last updated on June 8, 2026.
Eastleigh Borough Council issues Article 4 Directions, which are legal instruments for specific development control. The dataset is provided as a Web Map Service (WMS) and was last updated on 2026-06-19. It originates from the uk_data platform.
Yearly timesteps from 2020 to 2050 project land-use outcomes for a hybrid sustainable future scenario. The data was modeled using the CLUMondo (v5.1) land-system model and is provided at a 1km resolution for the broader European region. The scenario was authored by Venier Cambron, Camille and represents a combination of environmental value translations from the Natures Futures Framework.
Over 2800 historical ice charts were digitized to create this geospatial data base of ice concentration grids for the Great Lakes. The dataset was provided by the NOAA Great Lakes Environmental Research Laboratory (GLERL) to the National Snow and Ice Data Center (NSIDC). It captures a continuous 20-year record of winter ice conditions from 1960 through 1979.