Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
162,381 datasets
A metadata catalog listing information published by the La Bellezana Cooperative in compliance with Colombian transparency law. The dataset includes columns for information titles, responsible parties, generation dates, formats, and access methods. It is hosted on the Colombian open data portal www.datos.gov.co and was last updated on 2026-05-18.
Procedural data for traffic and transport services managed by the Road Safety Office of Antioquia. The dataset includes the type of procedure, vehicle type, and registration date. A first version covered procedures during 2021, with a second version updated to December 31, 2024.
A publication schema from the Manizales Chamber of Commerce for Caldas outlines information subject to proactive disclosure under Colombian Law 1712 of 2014. The dataset includes 12 columns such as 'Frecuencia de Actualización', 'Nombre de Responsable de la Información', and 'Formato'. It is published via the Colombian open data platform www.datos.gov.co and was last updated on 2026-05-18.
Administrative acts performed by the city administration of Palmira during the 2025 fiscal year. The dataset includes columns for act number, status, date, issuing department, and topic. It was published on the Colombian open data portal www.datos.gov.co and was last updated on 2026-05-26.
The City of Sherbrooke inaugurated nine walls for temporary graffiti and three walls for permanent graffiti on August 13, 2012. This dataset provides the locations of these designated sites, which are identified by plaques instructing artists on usage rules. The data is provided by the Government and Municipalities of Québec under a CC-BY-4.0 license.
Records of citizen participation in Information and Communication Technology (ICT) adoption programs funded by the DigiCampus call and short courses offered by the SETIC department of Valle del Cauca. The dataset includes columns for municipality, academic classification, gender, and program details. It is hosted on the Colombian open data portal www.datos.gov.co and was last updated on 2026-05-18.
A geospatial dataset mapping flow beds during low water periods or within retention basins in the City of Laval, Québec. The data is provided by the Government and Municipalities of Québec for informational purposes and is updated by the city to reflect its most current information. The dataset is available under a CC-BY-4.0 license and was last updated on April 22, 2026.
TasteBench is a benchmark of real, subjective taste judgements collected by breitburg. It contains aesthetic and creative decisions a real person made where there was no objectively correct answer, such as design direction, typography, curation, and visual style. Each item presents a situation and asks which option the person chose, with models scored against the actual human choice.
Christian L. L. Strauss authored a study exploring the overlap between bifactor and multilevel confirmatory factor analysis models. The 28.4 KB document, last updated on May 1, 2026, uses simulation and empirical analysis to demonstrate that bifactor solutions can emerge as artifacts of unmodeled data clustering. The work encourages researchers to consider multilevel measurement models as alternative explanations for bifactor structures.
A list of beneficiaries for the priority housing subsidy on owned land, allocated to 62 family nuclei dispersed in the urban areas of the Yopal and Aguazul municipalities in the Casanare Department. The dataset includes columns for responsible entity, municipality, beneficiary names, year, zone, allocation resolution, item, and modality. It is hosted on the Colombian open data portal www.datos.gov.co and was last updated on 2026-05-18.
A geospatial web service containing seabed morphology and geomorphology information for the Beagle Marine Park in south-eastern Australia. The data product was published by Nanson et al. in 2023 (eCat Record 147976) and is served via an OGC Web Feature Service (WFS). It is intended to support marine park management and regulatory activities.
A dataset from datos.gov.co, last updated on 2026-05-18, containing records of preliminary investigations initiated by territorial directorates of the Ministry of Labor in Colombia. The data likely tracks the number of initial fact-finding actions taken by officials to assess the probability of labor law violations. It is provided in CSV, JSON, XML, and RDF formats.
Data from data.cityofnewyork.us details financial plan initiatives and their allocated funding amounts over a five-year period. Dollar amounts are rounded to thousands, and the dataset is updated four times per year following key financial plan publications. The data includes columns for PUBLICATION DATE, FISCAL YEAR, AMOUNT YEAR 1 through AMOUNT YEAR 5, AGENCY NAME, INITIATIVE NAME, and FUNDING.
Municipal-level vaccination percentages for children in the Risaralda department, sourced from www.datos.gov.co. The dataset includes historical data from an unspecified time range, allowing for trend analysis of vaccination campaigns. Columns include Indicador, Fuente, Municipio, Unidad de Medida, Dato Numérico, and Año.
ESQUEMA PUBLICACION INFORMACION is a structured catalog from Colombia's open data portal detailing information published by government bodies. The dataset includes columns for language, format, responsible entity, and update frequency, likely serving as a metadata index. It is hosted on the Socrata platform by datos.gov.co and was last updated on May 18, 2026.
A dataset from the Mapa Inversiones platform, last updated on 2026-05-27, detailing entities responsible for executing public investment projects in Colombia. It is provided by datos.gov.co and includes columns for project identifiers and entity names. The dataset is available in multiple structured formats including CSV, JSON, XML, and RDF.
Beagle Marine Park in south-eastern Australia contains geospatial seabed morphology and geomorphology information. The data is published as an OGC Web Map Service (WMS) intended for marine park managers, regulators, and stakeholders. It is based on the data product from Nanson et al. (2023), eCat Record 147976.
ISVIMED is the official instrument for obligated entities in Colombia to report proactively published information. The dataset contains metadata about published information, including categories, formats, responsible parties, and update frequencies. It is hosted on the Colombian open data portal, datos.gov.co, and was last updated on 2026-05-18.
Satellite-derived bathymetry data for Shark Bay in Western Australia, processed from multispectral Sentinel-2 imagery. The dataset was produced by the University of Western Australia, using acquisitions from January 2017 to December 2020. It provides a geospatial baseline for environmental monitoring and management.
A historical series on the level of health insurance coverage in the municipality of Sabaneta, considering different affiliates by health regimes and municipal population. The data is provided by the portal www.observatoriosabaneta.org and was last updated on 2026-05-18. It is available for download in multiple formats including CSV, JSON, XML, and RDF.