Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
168,494 datasets
Data from 2026-05-18, last updated on the Socrata platform, details housing subsidies delivered by the Caja Promotora de Vivienda Militar y de Policía under the 'Vivienda 14' model for the purchase or improvement of used homes. The dataset is broken down by department, housing price range, and military or police force. It is provided by the Colombian open data portal www.datos.gov.co.
Colombian government data describing the publication schema for proactive information disclosure by public entities, as mandated by Law 1712 of 2014. The dataset includes metadata fields such as responsible parties, formats, and update frequencies. It is published by www.datos.gov.co and was last updated on 2026-05-18.
Flood signs in the city of York, United Kingdom. The dataset is a live API link to the City of York Council's GIS server, meaning updates to the master data are reflected immediately. It is published by the Government Digital Service under the OGL-UK-3.0 license.
A live-updated geospatial dataset of Gypsy and Traveller sites within the City of York. The data is published by the Government Digital Service and sourced from the City of York Council's GIS server, with changes to the master copy reflected immediately. The dataset is available under the OGL-UK-3.0 license.
York, UK, contains a dataset of General Practitioner (GP) surgery locations. The data is provided as a live API link to the City of York Council's GIS server, meaning updates to the master copy are reflected immediately. It is published by the Government Digital Service under the OGL-UK-3.0 license.
Colombian government web publication metadata compiled for compliance with Law 1712 of 2014 on transparency and access to public information. The dataset includes columns for information responsible parties, publication status, language, and update frequency. It is hosted by www.datos.gov.co and was last updated on 2026-05-18.
TACO (Text-to-SQL with Ambiguous and Cross-database Open-domain queries) is a benchmark for evaluating Text-to-SQL systems on real-world data-lake scenarios. It was created by Akanezora and was accepted for publication at VLDB 2026. The benchmark focuses on challenges absent in prior benchmarks like Spider or BIRD.
Colombian project profiles registered in 2020, sourced from datos.gov.co. The dataset includes columns for project objectives, results, productive lines, and detailed beneficiary demographics such as youth, victims, and reincorporated individuals. The data was last updated on 2026-05-18.
A 2026 data release for ABForge, a post-training pipeline for paper-grounded ablation design. The dataset provides Supervised Fine-Tuning (SFT) and Reinforcement Learning (RL) training pools and the held-out AblationBench evaluation sets. It was authored by SlowGuess and hosted on Hugging Face.
Australian Ocean Data Network hosts a legacy dataset titled 'Cretaceous marine macrofossils from the Great Artesian Basin in Queensland'. The dataset is published on data_gov_au and was last updated on 2026-06-17. No abstract or detailed metadata is available.
Legacy product from the Australian Ocean Data Network with no abstract available. The dataset describes a specialized drill designed for use on coral reef environments. It was last updated on 2026-06-17.
Fillipe Pedroso-Santos compiled a dataset on Amazonian anurans (frogs and toads) for research and conservation. The collection includes scripts, figures, appendix, and shapefiles, totaling 4.8 MB. It was last updated on May 22, 2026.
BMR proposals for drill sites for legs XXVIII and XXIX of the Deep Sea Drilling Project. The dataset is published by the Australian Ocean Data Network on data_gov_au. It was last updated on 2026-06-17.
Methods of sample preparation and analysis used in the Broad Sound estuary study project is a legacy dataset from the Australian Ocean Data Network. The dataset likely contains methodological documentation for an environmental monitoring project. Its last recorded update was 2026-06-17.
Contractual processes carried out by the municipality of Palmira during the 2022 fiscal year. The dataset includes columns such as OBJETO DEL CONTRATO, NUMERO DE PROCESO, VALOR CERTIFICADO DISPONIBLIDAD PUBLICA (CDP), and MODALIDAD DE SELECCIÓN. It is hosted on the Colombian open data portal www.datos.gov.co and was last updated on 2026-05-18.
Approved venues for marriage in York, sourced from the City of York Council's GIS server. The dataset is published by the Government Digital Service under the OGL-UK-3.0 license and is available in multiple formats including GeoJSON, KML, and CSV. The data is a live API link, meaning changes to the master copy are reflected immediately.
Informal Spaces in York provides a live GIS feed of parks, gardens, and green spaces managed by the City of York Council. The dataset is published by the Government Digital Service under the OGL-UK-3.0 license and is available in multiple geospatial formats. Changes to the master data are reflected immediately in the dataset resources.
A live-linked dataset of children's open spaces and play areas in York, UK, maintained by the City of York Council. The data is published by the Government Digital Service under an OGL-UK-3.0 license and is available in multiple geospatial formats. The dataset is connected to a live API, meaning updates to the master copy are reflected immediately.
Children's centres in York, UK, provided as a live API link to the City of York Council's GIS server. The dataset is published by the Government Digital Service under the OGL-UK-3.0 license and is available in GEOJSON, KML, and CSV formats. Updates to the master data are reflected immediately in this dataset.
York City Walls' Bars are represented as geospatial data points. The dataset is provided by the City of York Council and published via the Government Digital Service under the OGL-UK-3.0 license. The data is served as a live API link to the council's GIS server, meaning updates to the master copy are reflected immediately.