Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
161,680 datasets
Natural disasters in Colombia, sourced from OCHA and UNGRD. The dataset is provided in XLSX format and was last updated on May 26, 2026. The license is CC-BY-4.0.
Sectors and days for collecting waste, recyclable materials, and compostable materials in Québec. The dataset is provided by the Government and Municipalities of Québec under a CC-BY-4.0 license and was last updated on 2026-04-22. It is available in multiple formats including KML, GeoJSON, CSV, and ESRI REST.
A metadata registry from the Departmental Comptroller of Nariño, detailing its information publication schemes as mandated by Colombian Transparency Law 1712 of 2014. The dataset includes columns for information category, responsible entity, publication frequency, and format. It is hosted on the Colombian open data portal www.datos.gov.co and was last updated on 2026-05-18.
Administrative records from the Secretariat of Valorization and Capital Gains of San José de Cúcuta, Colombia, detailing individual property tax contributions. The data likely contains property identifiers, resolution numbers, and application dates for updating the municipal cadastral base. The dataset was last updated on 2026-05-18 and is available via the datos.gov.co platform.
A metadata catalog listing information published by the La Bellezana Cooperative in compliance with Colombian transparency law. The dataset includes columns for information titles, responsible parties, generation dates, formats, and access methods. It is hosted on the Colombian open data portal www.datos.gov.co and was last updated on 2026-05-18.
Procedural data for traffic and transport services managed by the Road Safety Office of Antioquia. The dataset includes the type of procedure, vehicle type, and registration date. A first version covered procedures during 2021, with a second version updated to December 31, 2024.
A publication schema from the Manizales Chamber of Commerce for Caldas outlines information subject to proactive disclosure under Colombian Law 1712 of 2014. The dataset includes 12 columns such as 'Frecuencia de Actualización', 'Nombre de Responsable de la Información', and 'Formato'. It is published via the Colombian open data platform www.datos.gov.co and was last updated on 2026-05-18.
Administrative acts performed by the city administration of Palmira during the 2025 fiscal year. The dataset includes columns for act number, status, date, issuing department, and topic. It was published on the Colombian open data portal www.datos.gov.co and was last updated on 2026-05-26.
The City of Sherbrooke inaugurated nine walls for temporary graffiti and three walls for permanent graffiti on August 13, 2012. This dataset provides the locations of these designated sites, which are identified by plaques instructing artists on usage rules. The data is provided by the Government and Municipalities of Québec under a CC-BY-4.0 license.
Records of citizen participation in Information and Communication Technology (ICT) adoption programs funded by the DigiCampus call and short courses offered by the SETIC department of Valle del Cauca. The dataset includes columns for municipality, academic classification, gender, and program details. It is hosted on the Colombian open data portal www.datos.gov.co and was last updated on 2026-05-18.
A geospatial dataset mapping flow beds during low water periods or within retention basins in the City of Laval, Québec. The data is provided by the Government and Municipalities of Québec for informational purposes and is updated by the city to reflect its most current information. The dataset is available under a CC-BY-4.0 license and was last updated on April 22, 2026.
TasteBench is a benchmark of real, subjective taste judgements collected by breitburg. It contains aesthetic and creative decisions a real person made where there was no objectively correct answer, such as design direction, typography, curation, and visual style. Each item presents a situation and asks which option the person chose, with models scored against the actual human choice.
Christian L. L. Strauss authored a study exploring the overlap between bifactor and multilevel confirmatory factor analysis models. The 28.4 KB document, last updated on May 1, 2026, uses simulation and empirical analysis to demonstrate that bifactor solutions can emerge as artifacts of unmodeled data clustering. The work encourages researchers to consider multilevel measurement models as alternative explanations for bifactor structures.
A list of beneficiaries for the priority housing subsidy on owned land, allocated to 62 family nuclei dispersed in the urban areas of the Yopal and Aguazul municipalities in the Casanare Department. The dataset includes columns for responsible entity, municipality, beneficiary names, year, zone, allocation resolution, item, and modality. It is hosted on the Colombian open data portal www.datos.gov.co and was last updated on 2026-05-18.
A geospatial web service containing seabed morphology and geomorphology information for the Beagle Marine Park in south-eastern Australia. The data product was published by Nanson et al. in 2023 (eCat Record 147976) and is served via an OGC Web Feature Service (WFS). It is intended to support marine park management and regulatory activities.
A dataset from datos.gov.co, last updated on 2026-05-18, containing records of preliminary investigations initiated by territorial directorates of the Ministry of Labor in Colombia. The data likely tracks the number of initial fact-finding actions taken by officials to assess the probability of labor law violations. It is provided in CSV, JSON, XML, and RDF formats.
Data from data.cityofnewyork.us details financial plan initiatives and their allocated funding amounts over a five-year period. Dollar amounts are rounded to thousands, and the dataset is updated four times per year following key financial plan publications. The data includes columns for PUBLICATION DATE, FISCAL YEAR, AMOUNT YEAR 1 through AMOUNT YEAR 5, AGENCY NAME, INITIATIVE NAME, and FUNDING.
Municipal-level vaccination percentages for children in the Risaralda department, sourced from www.datos.gov.co. The dataset includes historical data from an unspecified time range, allowing for trend analysis of vaccination campaigns. Columns include Indicador, Fuente, Municipio, Unidad de Medida, Dato Numérico, and Año.
ESQUEMA PUBLICACION INFORMACION is a structured catalog from Colombia's open data portal detailing information published by government bodies. The dataset includes columns for language, format, responsible entity, and update frequency, likely serving as a metadata index. It is hosted on the Socrata platform by datos.gov.co and was last updated on May 18, 2026.
A dataset from the Mapa Inversiones platform, last updated on 2026-05-27, detailing entities responsible for executing public investment projects in Colombia. It is provided by datos.gov.co and includes columns for project identifiers and entity names. The dataset is available in multiple structured formats including CSV, JSON, XML, and RDF.