DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Machine Learning Datasets | DataSalon

All Categories

🤖

Machine Learning

General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites

192,179 datasets

Machine Learning

WFD Cycle 2: Chemical Water Body Classification for the United Kingdom

Chemical status classifications for water bodies in the United Kingdom, assessed under the Water Framework Directive Cycle 2. The dataset records compliance with environmental standards for priority substances, with each water body classified as 'good' or 'fail' based on a one-out-all-out approach. The data was produced by the Environment Agency and is provided in XLSX format.

Tabular🇬🇧 United KingdomExcelEnvironmental monitoringChemical StatusWater Quality+1

0 views

Machine Learning

National Plan for Reliable Tuberculosis Laboratory Services Using a Systems Approach: Reco

A report from the Centers for Disease Control and Prevention and the Association of Public Health Laboratories Task Force presents a framework to improve tuberculosis (TB) laboratory services in the United States. It describes specific actions and performance measures to guide the development of an integrated system for TB testing and information flow. The author is Thomas M. Shinnick of the CDC.

TextLaboratory ServicesHealth PolicyCdcHealthcareTuberculosisPublic Health+1

0 views

Machine Learning

Satellite-Detected Damage Assessment for San Julian, Philippines, December 2014

UNOSAT identified 279 damaged structures in San Julian Area, Eastern Samar Province, Philippines using satellite imagery. The analysis, dated December 2014, categorizes 66 structures as destroyed, 148 as severely damaged, and 65 as moderately damaged. This is a preliminary, unvalidated assessment provided by the United Nations Satellite Centre.

GeospatialPhilippinesSatellite ImageryComputer VisionDamage AssessmentDisaster Response+1

0 views

Machine Learning

Senedd Cymru Constituency Boundaries for Wales, May 2026

May 2026 digital vector boundaries for Senedd Cymru constituencies in Wales. The dataset is provided by the Office for National Statistics and contains full-resolution boundaries clipped to the coastline.

GeospatialZIPCSVTextExcelPolitical BoundariesGovernmentWales+1

0 views

Machine Learning

NESP MaC 4.12: Aerial Survey of Mangrove Dieback in Northern Australia, 2024-2026

Northern Australia's Gulf of Carpentaria coastline was surveyed following mass mangrove dieback events in 1982 and 2015. The project, commissioned by the NESP Marine and Coastal Hub, produced aerial and satellite imagery, environmental surveys, and a technical report. Data outputs are managed by the Australian Ocean Data Network, with metadata last updated in July 2026.

ImageTabularGeospatialMangrove DiebackAerial SurveyEnvironmental monitoringCoastal ecosystems+1

0 views

Machine Learning

QAGOMA Annual Consultancy Expenditure Data for Recent Fiscal Years

Annual consultancy expenditure data for the Queensland Art Gallery and Gallery of Modern Art (QAGOMA). The dataset reports nil expenditure for the 2022-23, 2023-24, and 2024-25 financial years. It is published by QAGOMA governance under a CC-BY-4.0 license and was last updated in July 2026.

TabularCSVGovernment SpendingArts CultureConsultancy ExpenditureAnnual Report+1

0 views

Machine Learning

RV Investigator IN2025_V06: Echosounder Data from the Coral Sea Frontier

October 10 to November 14, 2025, acoustic backscatter data was collected on the RV Investigator voyage IN2025_V06, The Coral Sea Frontier. The dataset comprises 4,528 files totalling 109 GB of raw data from Simrad EK80 split-beam echosounders at 18, 38, 70, 120, 200, and 333 kHz, stored by CSIRO. Additional data products may be available on request.

AudioTime SeriesGeospatialOceanographyAcoustic BackscatterMarine ScienceSplit Beam Echosounder+1

0 views

Machine Learning

IN2025_V06: Gravity Data from the Coral Sea Frontier Voyage

74 raw data files totalling 5785 MB of gravity measurements collected aboard the RV Investigator voyage IN2025_V06. The Coral Sea Frontier voyage departed Brisbane on 10 October 2025 and returned on 14 November 2025, using a MicroG Lacoste Air-Sea II gravity meter. Data are stored by CSIRO, with additional information in a GSM data acquisition and processing report.

Time SeriesGeospatialCoral SeaMarine ScienceGeophysicsOceanographic SurveyGravity Measurement+1

0 views

Machine Learning

Northern Ireland Office Workforce: Monthly Staff Numbers and Costs

Northern Ireland Office (NIO) publishes monthly workforce management information. The data includes staff numbers and payroll costs, split between payroll and non-payroll (contingent labour) staff, with figures provided as full-time equivalents (FTE) and headcount mapped to Civil Service grades. The published data is validated, released in CSV format, and is OGL-licensed for reuse.

TabularTime SeriesCSVWorkforce ManagementPublic SectorNorthern Ireland OfficeStaff Numbers+1

0 views

Machine Learning

SUPERSEDED 2006 - Ipswich Planning Scheme: Schedule 3 Identified Places of Interest

Schedule 3 of the superseded 2006 Ipswich Planning Scheme contains a list of identified places of interest. The dataset is provided by Ipswich City Council and was last updated in July 2026. It is available in multiple geospatial formats including SHP, GEOJSON, and KML.

GeospatialPlaces Of InterestSuperseded DataUrban PlanningLocal Government+1

0 views

Machine Learning

UK Travel to Work Areas Map for 2001 Census

A 2 MB PDF map from the Office for National Statistics shows travel to work areas in the United Kingdom as of December 2001. The map likely contains geographic boundaries defining labor market areas. It is licensed under OGL-UK-3.0.

GeospatialUnited KingdomTravel To WorkCensus 2001+1

0 views

Machine Learning

UK Travel to Work Areas Map from December 2011

A PDF map visualizes the official Travel to Work Areas (TTWAs) for the United Kingdom as defined in December 2011. The map file is 3 MB in size and was produced by the Office for National Statistics. This geographic boundary data was last updated in the platform's metadata on July 8, 2026.

GeospatialGeospatial BoundariesTravel To Work AreasUnited KingdomCensus Geography+1

0 views

Machine Learning

Middle Layer Super Output Areas Map for East Midlands Region, December 2011

Office for National Statistics provides a 33 MB PDF map detailing the Middle Layer Super Output Areas (MSOAs) for England's East Midlands region. This geospatial data snapshot captures the administrative boundaries as they were defined in December 2011. The map likely contains polygon boundaries for statistical analysis and regional planning.

GeospatialRegional MapsAdministrative BoundariesCensus GeographyUk Geography+1

0 views

Machine Learning

Middle Layer Super Output Areas Map for Wales, December 2011

December 2011 boundaries for Middle Layer Super Output Areas (MSOAs) in Wales, provided as a PDF map by the Office for National Statistics. The map file size is 16 MB. The dataset was last updated on the platform in July 2026.

GeospatialAdministrative BoundariesCensus GeographyUk GeographyWales+1

0 views

Machine Learning

UK Census Output Areas Map for South West England, December 2011

A PDF map illustrates the boundaries of Output Areas within the South West Region of England as defined in December 2011. The map was produced by the Office for National Statistics and is a 15 MB file. The data snapshot reflects the administrative geography used for the 2011 UK Census.

GeospatialGeospatial BoundariesSouth West EnglandUk CensusAdministrative Areas+1

0 views

Machine Learning

Strawberry Fruit Shape: Images and Morphometric Features

6,874 binary images of individual strawberry fruits, each standardized to 1000x1000 pixels, are paired with a CSV file containing 431 morphometric features. The dataset captures fruit from two distinct harvest dates, enabling temporal shape analysis. Features were extracted using methods including Elliptical Fourier Analysis, Generalized Procrustes Analysis, and EigenFruit Analysis.

ImageTabularStrawberry Fruit ShapePlant BiologyImage AnalysisComputer VisionMorphometricsHorticulture+1

0 views

Machine Learning

BrainIAK Tutorials: Condensed Neuroimaging Datasets

BrainIAK tutorials provide a collection of pre-processed neuroimaging datasets, condensed from original studies to reduce file size. The collection includes datasets such as VDC, Ninety Six, Face-scene, Latatt, Pieman2, Raider, and Sherlock_processed, paired with specific tutorials for practical application. These datasets are ready for use in teaching and demonstrating methods for functional magnetic resonance imaging (fMRI) data analysis.

MultimodalBrainiakCognitive ScienceFmriNeuroscienceTutorialSynthetic+1

0 views

Machine Learning

Sensitivity of Detection for Fugitive Methane Emissions from Coal Seam Gas Fields

A study analyzing the sensitivity of atmospheric monitoring techniques for detecting fugitive methane emissions from coal seam gas (CSG) fields. The work is based on three years of continuous methane and carbon dioxide measurements from the 'Arcturus' monitoring station in Australia's Bowen Basin. Researchers used a coupled meteorological and air pollution model to simulate emissions and statistically compare perturbed signals against a baseline.

TabularTime SeriesEnvironmental scienceBenchmarkCoal Seam GasGreenhouse GasMethane EmissionsAtmospheric MonitoringSynthetic+1

0 views

Machine Learning

Opticomm Pty Ltd Anticipatory Notices for FTTP Network Projects in Australia

Australian Communications and Media Authority published anticipatory notices for seven fiber-to-the-premises (FTTP) network project areas by Opticomm Pty Ltd. The data includes project locations, estimated completion dates, and contract details, with the latest update on 2 July 2026. Records contain formatted addresses and latitude-longitude coordinates for each project area.

TabularGeospatial🇦🇺 AustraliaZIPExcelTelecommunicationsInfrastructure PlanningFttp+1

0 views

Machine Learning

Wet Tropics Mountain Locations: 344 Points from 1:100,000 Scale Maps

344 point features depict mountains, peaks, ranges, and hills within the Wet Tropics Bioregion. Data originated from 1:250,000 scale topographic sources and was corrected for accuracy against 1:100,000 scale maps, with 90% of points within +/- 50 metres of true position. The dataset is provided by the Wet Tropics Management Authority via the Australian Ocean Data Network.

Geospatial🇦🇺 AustraliaMountainsTopographyWet TropicsGeography+1

0 views

PreviousPage 14 of 9585Next