Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
147,202 datasets
A tabular dataset from the Government of British Columbia reports the amount of roaded and roadless land area within each ecoregion of the province. The analysis used the provincial Digital Road Atlas as of May 1, 2018, defining roaded areas as those within 500 metres of a road and roadless areas as those beyond 500 metres. Areas are reported in hectares to three significant figures for British Columbia and its constituent ecoregions.
A historic record of all Land Act tenures in good standing on April 1, 2009, sourced from the TANTALIS system. Crown tenures include leases, licences, rights-of-way, permits, reserves, and notations issued under agreements between individuals or companies and the provincial government. This data was used to inform the Ministry of Forests, Lands and Natural Resource Operations Crown Land Indicators and Statistics report.
5.5 KB of data in XLS format, authored by Bradley Mason and last updated on 2026-05-29. The dataset likely contains parameters and constraints related to the μ, σ, α, and β distribution parameters used during the expectation-maximization algorithm.
Model output results before and after applying standard deviation scaling. The dataset, created by Bradley Mason and shared under a CC-BY-4.0 license, contains dimensional-independent standard deviation scaling results for synthetic and real monocyte gated clusters. It was last updated on May 29, 2026.
Anonymised GPS metrics for U16 and U18 inter-county Ladies Gaelic Football players, broken down by halves of play. The 50.9 KB XLSX file was authored by Teresa Molohan and last updated on 2026-05-29. Data is available under a CC-BY-4.0 license via figshare.
A document outlining six core principles guiding scientific activities at Geoscience Australia. It describes how the agency conducts science, embedding these principles into strategic planning and daily operations. The principles emphasize relevance, collaboration, quality, transparency, communication, and sustained capability.
A geospatial layer delimiting the perimeter of the 2017-2019 flooded area declared as a special intervention zone in Quebec. The Government and Municipalities of Québec created this zone using data sources from the 2017 and 2019 floods, and it was last updated on the platform in April 2026. The dataset is available under a CC-BY-4.0 license.
A list of council members (ediles and edilesas) for the municipality of Yopal in the Casanare department of Colombia. The dataset distinguishes between urban and rural representatives, with urban members linked to one of five communes and rural members linked to a corregimiento. The data is provided by the Colombian open data platform www.datos.gov.co and was last updated on May 18, 2026.
Supplementary material containing calculated values for stress accumulation rate distributions along the Nankai Trough subduction zone. The data was authored by Yusuke Yokota and is licensed under CC-BY-4.0. It was last updated on June 3, 2026.
A processed county-year panel dataset for reconstructing tertiary industry added value in Shanxi Province, China, from 2012 to 2022. It integrates multi-source remote sensing, geospatial, and socioeconomic information within a unified framework. The dataset was created by PENGFEI Jia and includes processed predictor tables, metadata, and reproducible code files.
Monthly sales tax collections for the 2% City of Baton Rouge tax, 2% Unincorporated East Baton Rouge Parish tax, and the combined City-Parish tax. Figures are broken down into regular collections managed by the City-Parish Revenue Division and vehicle tax collections managed by the State Department of Public Safety. The dataset includes separate totals for city, parish, and combined jurisdictions, as well as subtotals for vehicle and non-vehicle sales.
Yuhong Liu published parameter estimates and confidence intervals on figshare in June 2026. The dataset contains maximum likelihood estimates and 95% profile likelihood confidence interval bounds for all parameters across all models. It is a small dataset, 12.3 KB in size, shared under a CC-BY-4.0 license.
All pairwise post-hoc comparison results (FDR-corrected) for simulation outputs and geometric metrics. The dataset includes Cohen’s d effect sizes and 95% confidence intervals. It was authored by José Alonso Solís-Lemus and last updated on June 2, 2026.
Benchmark regression results from a study on agricultural digitalization and export quality in China. The dataset contains panel data from 34 provincial administrative regions in China spanning 2001 to 2024, compiled by author Zhichao Jiang. It was last updated on May 12, 2026, and is shared under a CC-BY-4.0 license.
93.7% overall accuracy is reported for a dynamic-static classification task. The dataset is a confusion matrix, likely containing counts of true vs. predicted labels, shared by author Hongmin Wang on figshare. It was last updated on June 2, 2026.
A 10-meter resolution Digital Elevation Model (DEM) for the Victorian coast, integrating new bathymetric datasets to update versions from 2017 and 2010. The dataset, produced by Veris for the Department of Energy, Environment and Climate Action, includes both high-resolution and seamless gridded outputs. It serves as a foundational product for the Victorian Coastal Mapping Program to improve understanding of dynamic coastal processes and risks.
Annual records from a program to improve food security in the Department of Casanare, Colombia, for households with children, adolescents, pregnant/lactating mothers, people with disabilities, and the elderly. The dataset is hosted on the Colombian open data portal www.datos.gov.co and was last updated on 2026-05-18. It lists beneficiaries of nutritional packages, self-consumption gardens, and healthy lifestyle training.
500 omnidirectional videos with a mean duration of 18.1 seconds form the largest dataset of its kind. Fixations were collected from over 2000 observers, averaging more than 84 per video, while audio tracks were played during collection. The dataset, created by ANDRYHA and last updated in 2026, consists of high-resolution 3840x1920 streams sourced from YouTube.
Electoral districts define the municipal voting wards within the city of Trois-Rivières, Quebec. The dataset is available under a permissive CC-BY-4.0 license, facilitating reuse and redistribution. It is provided by the Government and Municipalities of Québec and is available in multiple geospatial and tabular formats.
Open data from the Government of Québec provides point locations for civic addresses within the municipalities of Saguenay and Lévis. The dataset is available in multiple geospatial and tabular formats, including SHP, GEOJSON, and CSV. Its primary use is for location-based services, urban planning, and emergency response within these specific Canadian cities.