Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
168,495 datasets
York, UK, contains a dataset of General Practitioner (GP) surgery locations. The data is provided as a live API link to the City of York Council's GIS server, meaning updates to the master copy are reflected immediately. It is published by the Government Digital Service under the OGL-UK-3.0 license.
Colombian government web publication metadata compiled for compliance with Law 1712 of 2014 on transparency and access to public information. The dataset includes columns for information responsible parties, publication status, language, and update frequency. It is hosted by www.datos.gov.co and was last updated on 2026-05-18.
TACO (Text-to-SQL with Ambiguous and Cross-database Open-domain queries) is a benchmark for evaluating Text-to-SQL systems on real-world data-lake scenarios. It was created by Akanezora and was accepted for publication at VLDB 2026. The benchmark focuses on challenges absent in prior benchmarks like Spider or BIRD.
Colombian project profiles registered in 2020, sourced from datos.gov.co. The dataset includes columns for project objectives, results, productive lines, and detailed beneficiary demographics such as youth, victims, and reincorporated individuals. The data was last updated on 2026-05-18.
A 2026 data release for ABForge, a post-training pipeline for paper-grounded ablation design. The dataset provides Supervised Fine-Tuning (SFT) and Reinforcement Learning (RL) training pools and the held-out AblationBench evaluation sets. It was authored by SlowGuess and hosted on Hugging Face.
Australian Ocean Data Network hosts a legacy dataset titled 'Cretaceous marine macrofossils from the Great Artesian Basin in Queensland'. The dataset is published on data_gov_au and was last updated on 2026-06-17. No abstract or detailed metadata is available.
Legacy product from the Australian Ocean Data Network with no abstract available. The dataset describes a specialized drill designed for use on coral reef environments. It was last updated on 2026-06-17.
Fillipe Pedroso-Santos compiled a dataset on Amazonian anurans (frogs and toads) for research and conservation. The collection includes scripts, figures, appendix, and shapefiles, totaling 4.8 MB. It was last updated on May 22, 2026.
BMR proposals for drill sites for legs XXVIII and XXIX of the Deep Sea Drilling Project. The dataset is published by the Australian Ocean Data Network on data_gov_au. It was last updated on 2026-06-17.
Methods of sample preparation and analysis used in the Broad Sound estuary study project is a legacy dataset from the Australian Ocean Data Network. The dataset likely contains methodological documentation for an environmental monitoring project. Its last recorded update was 2026-06-17.
Contractual processes carried out by the municipality of Palmira during the 2022 fiscal year. The dataset includes columns such as OBJETO DEL CONTRATO, NUMERO DE PROCESO, VALOR CERTIFICADO DISPONIBLIDAD PUBLICA (CDP), and MODALIDAD DE SELECCIÓN. It is hosted on the Colombian open data portal www.datos.gov.co and was last updated on 2026-05-18.
Approved venues for marriage in York, sourced from the City of York Council's GIS server. The dataset is published by the Government Digital Service under the OGL-UK-3.0 license and is available in multiple formats including GeoJSON, KML, and CSV. The data is a live API link, meaning changes to the master copy are reflected immediately.
Informal Spaces in York provides a live GIS feed of parks, gardens, and green spaces managed by the City of York Council. The dataset is published by the Government Digital Service under the OGL-UK-3.0 license and is available in multiple geospatial formats. Changes to the master data are reflected immediately in the dataset resources.
A live-linked dataset of children's open spaces and play areas in York, UK, maintained by the City of York Council. The data is published by the Government Digital Service under an OGL-UK-3.0 license and is available in multiple geospatial formats. The dataset is connected to a live API, meaning updates to the master copy are reflected immediately.
Children's centres in York, UK, provided as a live API link to the City of York Council's GIS server. The dataset is published by the Government Digital Service under the OGL-UK-3.0 license and is available in GEOJSON, KML, and CSV formats. Updates to the master data are reflected immediately in this dataset.
York City Walls' Bars are represented as geospatial data points. The dataset is provided by the City of York Council and published via the Government Digital Service under the OGL-UK-3.0 license. The data is served as a live API link to the council's GIS server, meaning updates to the master copy are reflected immediately.
Talking signs in York provide location data for accessibility infrastructure. The dataset is a live API link to the City of York Council's GIS server, ensuring updates to the master copy are reflected immediately. It is published by the Government Digital Service under the OGL-UK-3.0 license.
Student enrollment records from the institution known as the Escuela Superior de Artes Débora Arango, covering the period from 2019 to 2025. The data includes demographic details like sex, birth year, and place of birth, as well as academic information such as program of study and enrollment periods. It is hosted on the Colombian open data portal www.datos.gov.co and was last updated in May 2026.
Legacy product from the Australian Ocean Data Network, last updated on 2026-06-17. The dataset likely contains information about the seafloor and geological features of the Huon Gulf region near New Guinea. Metadata is minimal, with no abstract or column details available, and the primary file formats are HTML and PDF.
A reconnaissance dataset for Ashmore Reef, published by the Australian Ocean Data Network on data.gov.au. The dataset's last update was recorded as 2026-06-17. The raw description indicates it is a legacy product with no abstract available.