Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
149,839 datasets
Statistical information on individuals who have competed for diplomatic and consular careers. The dataset includes age group, region, profession, and other details for applicants. It is published by datos.gov.co and was last updated on 2026-05-18. The data is available in CSV, JSON, XML, and RDF formats.
Causeway Coast and Glens Borough Council provides a dataset listing the locations of all recycling facilities within its administrative area. The dataset is published by the Government Digital Service under an Open Government Licence. The last update date is unknown.
A 4.4 KB dataset compiled by Robert Marcelo Sevilla, last updated on 2026-06-01. It contains language and phyla counts for 28 mainland Chinese provinces and 4 regions within Hunan, sourced from Ethnologue. Calculations for language density, genetic density, and proportion of the world's total are included.
Monthly traffic enforcement data from Edmonton's Intersection Safety Devices includes tickets issued for specific speed violations and red light infractions. The dataset, hosted by data.edmonton.ca, provides geospatial coordinates and administrative boundaries for each device location. Columns such as '1-5 Over Speed Limit' and 'Total Red Light Tickets' allow for granular analysis of traffic violations by location and time.
A daily-updated listing of all active and closed Montgomery County solicitations issued in the past 7 years. The dataset is published by data.montgomerycountymd.gov and includes details on solicitation type, status, dates, and issuing departments. It was last updated on 2026-05-29.
Eye-tracking research analyzes the effectiveness of visual wine tourism advertising on digital travel platforms. The 701.5 KB dataset, authored by JJ C-H and last updated in May 2026, likely contains metrics on visual attention, processing, and ad recall. It studies images featuring human presence, wineries, and vineyards on platforms like Airbnb, Booking.com, and Tripadvisor.
Zaragoza, Spain is the focus of this dataset containing location points and descriptions used to calculate distances and deviations. The analysis compares spatial variations between Anton van den Wyngaerde's 1563 chorography of the city and current maps. The dataset was created by Gabriel Marro-Gros and is 2.1 MB in size.
Mengyao Su's dataset provides 11.5 GB of 4D-STEM experimental and simulated data for evaluating and correcting projector lens aberrations in electron ptychography. The data was last updated on June 3, 2026, and is shared under a CC-BY-4.0 license. File formats include DB, TXT, NPY, PNG, CSV, and PT.
A clinical study investigating gut microbiome and metabolic changes in people living with HIV (PLWH) who have metabolic dysfunction-associated fatty liver disease (MAFLD). The dataset is hosted on figshare, authored by HuiTing Liu, and was last updated on May 30, 2026. The global prevalence of MAFLD in PLWH is estimated at 34%.
2.0 MB of location points and schematic context used to calculate distances and deviations between a historical chorography of Zaragoza and current maps. The dataset, created by Gabriel Marro-Gros and last updated on 2026-05-24, supports the analysis of spatial variations in the city's depiction. It includes files in TIFF, CSV, and TXT formats and is shared under a CC-BY-4.0 license.
Weiming Yu's research dataset quantifies the relationship between the intelligent economy and Green Total Factor Productivity (GTFP) across Chinese cities from 2014 to 2024. The 5.9 MB XLSX file contains a comprehensive panel dataset analyzed using bidirectional fixed effects, moderation analysis, and panel threshold modeling. It was last updated on May 9, 2026, and is shared under a CC-BY-4.0 license.
Distritos Navales y Dispensarios de Sanidad – Armada Nacional is a dataset from the Colombian National Navy listing its naval districts and health dispensaries. The data includes location and contact details for units across the country, sourced from the national open data portal datos.gov.co. The dataset was last updated on 2026-05-18.
Records from 311 complaints received by the New York City Department of Health and Mental Hygiene (DOHMH). Each record represents a single complaint about indoor environmental conditions, such as air quality, pests, or mold. The data can be used to identify complaint patterns but does not confirm the presence of violations.
Upper Shaunavon Isopach is a geospatial dataset published by the Government of Saskatchewan on the open_canada platform. It likely contains thickness measurements or contours for a geological formation. The dataset was last updated on 2026-06-03.
Shaunavon Zero Edge is a geospatial dataset published by the Government of Saskatchewan. It is available in multiple formats including KML, SHP, and GeoJSON. The dataset was last updated on June 3, 2026.
From 25 February 2015 to 04 September 2025, this dataset contains sea temperature measurements collected by loggers deployed around Mackay Reef. It provides a long-term record for monitoring marine environmental conditions. The data is hosted by the Australian Ocean Data Network.
Data.calgary.ca provides boundaries for the 2022 Alcohol in the Park Project. The dataset includes columns such as LOCATION, REGION_TYPE, and MULTIPOLYGON. It was last updated on 2026-05-28 23:38:25.
Wenting Cao developed an automated framework for annual mangrove mapping across all Pacific Island Countries from 2013 to 2024 using Landsat-8 imagery. The dataset reveals a net mangrove increase of 53,153 hectares (9.0%) with substantial local turnover, driven by landward migration and losses from cyclones and development. The framework achieved high classification accuracy (Overall Accuracy 94.5–96.0%) and is designed for cloud-prone, data-scarce regions.
A 30-year time series of sea water temperature measurements collected by loggers deployed around Hayman Island in the Great Barrier Reef, from 29 May 1996 to 22 Apr 2026. The data was collected by the Australian Ocean Data Network and last updated on the platform in June 2026.
Global Monthly EASE-Grid Snow Water Equivalent Climatology provides a 29-year record of satellite-derived snow mass data from November 1978 to May 2007. The dataset is produced by NASA, gridded at 25 km resolution on Northern and Southern Hemisphere Equal-Area Scalable Earth Grids. It combines measurements from SMMR and SSM/I sensors, with Northern Hemisphere data enhanced by snow cover frequency information.