Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
167,055 datasets
Water consumption records from the Municipal Water, Sewerage, and Sanitation Company of Funza (EMAAF ESP), classified by month, usage type, and socioeconomic stratum. The dataset is available in multiple formats including CSV, JSON, XML, and RDF. It was last updated on May 18, 2026, and is hosted on the Colombian open data portal www.datos.gov.co.
A flood study report for the Middle Harbour Northern Catchments, authored by Ku-Ring-Gai Council and last updated on 2026-05-28. The study area includes Middle Harbour Creek and its tributaries in St Ives, and Rocky Creek, Stoney Creek, and High Ridge Creek in Gordon/East Killara, which flow into Middle Harbour.
800 episodes of robot manipulation data created using the LeRobot framework. The dataset contains 20,000 frames recorded at 15 frames per second. It is structured for training imitation learning models on a single push task.
Xarm Lift Medium Replay Image is a dataset of 800 episodes and 20,000 frames created using the LeRobot framework. The dataset likely contains image observations and teleoperation data for a robotic lifting task. It was last updated on June 8, 2026.
Natural disasters in Colombia, sourced from OCHA and UNGRD. The dataset is provided in XLSX format and was last updated on May 26, 2026. The license is CC-BY-4.0.
Sectors and days for collecting waste, recyclable materials, and compostable materials in Québec. The dataset is provided by the Government and Municipalities of Québec under a CC-BY-4.0 license and was last updated on 2026-04-22. It is available in multiple formats including KML, GeoJSON, CSV, and ESRI REST.
A metadata registry from the Departmental Comptroller of Nariño, detailing its information publication schemes as mandated by Colombian Transparency Law 1712 of 2014. The dataset includes columns for information category, responsible entity, publication frequency, and format. It is hosted on the Colombian open data portal www.datos.gov.co and was last updated on 2026-05-18.
Administrative records from the Secretariat of Valorization and Capital Gains of San José de Cúcuta, Colombia, detailing individual property tax contributions. The data likely contains property identifiers, resolution numbers, and application dates for updating the municipal cadastral base. The dataset was last updated on 2026-05-18 and is available via the datos.gov.co platform.
A metadata catalog listing information published by the La Bellezana Cooperative in compliance with Colombian transparency law. The dataset includes columns for information titles, responsible parties, generation dates, formats, and access methods. It is hosted on the Colombian open data portal www.datos.gov.co and was last updated on 2026-05-18.
Procedural data for traffic and transport services managed by the Road Safety Office of Antioquia. The dataset includes the type of procedure, vehicle type, and registration date. A first version covered procedures during 2021, with a second version updated to December 31, 2024.
A publication schema from the Manizales Chamber of Commerce for Caldas outlines information subject to proactive disclosure under Colombian Law 1712 of 2014. The dataset includes 12 columns such as 'Frecuencia de Actualización', 'Nombre de Responsable de la Información', and 'Formato'. It is published via the Colombian open data platform www.datos.gov.co and was last updated on 2026-05-18.
Administrative acts performed by the city administration of Palmira during the 2025 fiscal year. The dataset includes columns for act number, status, date, issuing department, and topic. It was published on the Colombian open data portal www.datos.gov.co and was last updated on 2026-05-26.
The City of Sherbrooke inaugurated nine walls for temporary graffiti and three walls for permanent graffiti on August 13, 2012. This dataset provides the locations of these designated sites, which are identified by plaques instructing artists on usage rules. The data is provided by the Government and Municipalities of Québec under a CC-BY-4.0 license.
Records of citizen participation in Information and Communication Technology (ICT) adoption programs funded by the DigiCampus call and short courses offered by the SETIC department of Valle del Cauca. The dataset includes columns for municipality, academic classification, gender, and program details. It is hosted on the Colombian open data portal www.datos.gov.co and was last updated on 2026-05-18.
A geospatial dataset mapping flow beds during low water periods or within retention basins in the City of Laval, Québec. The data is provided by the Government and Municipalities of Québec for informational purposes and is updated by the city to reflect its most current information. The dataset is available under a CC-BY-4.0 license and was last updated on April 22, 2026.
TasteBench is a benchmark of real, subjective taste judgements collected by breitburg. It contains aesthetic and creative decisions a real person made where there was no objectively correct answer, such as design direction, typography, curation, and visual style. Each item presents a situation and asks which option the person chose, with models scored against the actual human choice.
Christian L. L. Strauss authored a study exploring the overlap between bifactor and multilevel confirmatory factor analysis models. The 28.4 KB document, last updated on May 1, 2026, uses simulation and empirical analysis to demonstrate that bifactor solutions can emerge as artifacts of unmodeled data clustering. The work encourages researchers to consider multilevel measurement models as alternative explanations for bifactor structures.
A list of beneficiaries for the priority housing subsidy on owned land, allocated to 62 family nuclei dispersed in the urban areas of the Yopal and Aguazul municipalities in the Casanare Department. The dataset includes columns for responsible entity, municipality, beneficiary names, year, zone, allocation resolution, item, and modality. It is hosted on the Colombian open data portal www.datos.gov.co and was last updated on 2026-05-18.
A geospatial web service containing seabed morphology and geomorphology information for the Beagle Marine Park in south-eastern Australia. The data product was published by Nanson et al. in 2023 (eCat Record 147976) and is served via an OGC Web Feature Service (WFS). It is intended to support marine park management and regulatory activities.
A dataset from datos.gov.co, last updated on 2026-05-18, containing records of preliminary investigations initiated by territorial directorates of the Ministry of Labor in Colombia. The data likely tracks the number of initial fact-finding actions taken by officials to assess the probability of labor law violations. It is provided in CSV, JSON, XML, and RDF formats.