Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
160,842 datasets
The Plan Anual de Adquisiciones 2022 aims to increase competition in state procurement processes. It includes columns for estimated contract value, selection modality, UNSPSC codes, and contact details for responsible officials. The dataset is hosted by www.datos.gov.co and was last updated on 2026-05-18.
Orthophotographs of the Chaudière-Appalaches administrative region in Québec, captured in spring 2020. The dataset is provided by the Government and Municipalities of Québec under a CC-BY-4.0 license. Available file formats include SHP, CSV, HTML, and OTHER, suggesting a mix of imagery and associated metadata.
A list of existing framework agreements for professional internships between Unidades Tecnológicas de Santander and various companies across Colombia. The dataset includes academic programs, partner companies, and agreement start and expiration dates. It was last updated on 2026-05-18 and is hosted by the Colombian open data portal www.datos.gov.co.
Orthophoto_2021 is a high-resolution aerial imagery dataset of the city of Rimouski, Québec, provided by the Government and Municipalities of Québec. The dataset is cut into 1km x 1km tiles with a resolution of 4 centimeters per pixel. It was last updated on the platform in April 2026 and is available under a CC-BY-4.0 license.
Monthly counts of active migrant health insurance affiliates in Colombia's Norte de Santander department, specifically those holding a Special Permit of Permanence (PEP). The data is organized by municipality and month, with columns for each month of the year and municipality identifiers. It originates from the Colombian open data portal, datos.gov.co, and was last updated in May 2026.
EPM shares its active information assets register as open data, a requirement under Colombia's Transparency and Access to Information Law 1712 of 2014. The dataset is published via the national open data portal www.datos.gov.co and was last updated in May 2026. Columns suggest a catalog of information resources, including their responsible unit, format, language, and status.
Monitoring station measurements of physical-chemical parameters across the upper, middle, and lower basin of the Atrato River. Data includes station names, coordinates, measurement values, units, and precise timestamps. The dataset is hosted by www.datos.gov.co and was last updated on 2026-05-18.
A 2026 dataset from figshare by Lida Xing, licensed under CC-BY-4.0. It documents the distribution of tetrapod tracks, including dinosaurs and pterosaurs, at the Bajiedong tracksite in Southwest China. The dataset is stored in a RAR archive and its specific size and row count are not provided.
Weekly counts of laboratory-confirmed influenza cases are reported by age group and county. The data is published by health.data.ny.gov and was last updated on May 22, 2026. Columns suggest temporal and geographic dimensions for tracking flu activity.
From the second week of August 2020, this dataset logs water service interruptions managed by Aguas Nacionales EPM S.A. E.S.P. It includes the reason, duration, and precise location of affected sectors. The data is provided by www.datos.gov.co and was last updated on 2026-05-18.
A dataset from datos.gov.co, last updated on 2026-05-18, providing statistics on higher education in Colombia. It likely contains annual enrollment figures broken down by institution, academic program, and student demographics. The data is sourced from the Socrata platform and is available in CSV, JSON, XML, and RDF formats.
Numerator and denominator data for calculating neonatal mortality rates across municipalities in the Bolívar Department of Colombia from 2010 to 2024. The dataset is sourced from the Colombian National Administrative Department of Statistics (DANE) and includes columns for yearly numerators (deaths of live births within the first 28 days), denominators (live births), and calculated results. It was last updated on the datos.gov.co platform in May 2026.
Ground-based Leaf Area Index (LAI) data from ENON and Nakakawane, paired with corresponding cloud-free Sentinel-2 satellite imagery. The dataset was authored by Xuanwen Wang and last updated on June 4, 2026. It is a small dataset of 806.9 KB, stored in CSV format.
Scottish Water's Sewer Catchment Areas, also known as Drainage Operational Areas (DOAs), define the geographic zones where wastewater assets and surface water flow to a single Sewage Treatment Works or Public Septic Tank. The dataset is provided by the Scottish Government via SpatialData.gov.scot and was last updated on 2026-06-04.
Disbursement details for agricultural development credit from 2020 onward, hosted on Colombia's open data portal. The data includes loan values, interest rates, beneficiary characteristics, and financial inclusion indicators for producers accessing credit under FINAGRO conditions. Columns such as `tipo_productor`, `valor_credito`, and `sexo` provide insights into the recipients and terms of these loans.
96 information assets are cataloged in this metadata schema for the proactive disclosure of information by the Mayor's Office of Bucaramanga. The dataset describes planned and published information resources from 2021 to 2025, with columns for responsible parties, formats, and update frequency. It is published on the datos.gov.co platform via Socrata and was last updated on 2026-05-18.
Macro routes for street sweeping and cleaning services operated by EMVARIAS Grupo EPM. The dataset includes schedules for the corregimientos of San Cristóbal, Santa Elena, and San Antonio de Prado, with frequencies ranging from Monday to Saturday, every other day, and twice per week. It is hosted on the Colombian open data portal www.datos.gov.co and was last updated on 2026-05-18.
LeRobot created this dataset for robot learning. It contains 800 episodes and 20,000 frames of data, recorded at 15 frames per second. The dataset was last updated on June 8, 2026.
800 robot manipulation episodes generated using the LeRobot framework. The dataset contains 20,000 frames recorded at 15 frames per second, focusing on a single task. It was created by lerobot and last updated in June 2026.
60 episodes of robot manipulation data created using LeRobot, totaling 38,861 frames. The dataset is structured for training and contains video and structured data files for a single task performed by a Franka robot. It was last updated on June 15, 2026.