Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
168,492 datasets
Data on sanctions imposed across economic sectors in Colombia, distinguishing between non-executed and executed penalties. The dataset includes the number and monetary value of sanctions in Colombian pesos for each sector. It is hosted on the datos.gov.co platform and was last updated on 2026-05-18.
Mapa Inversiones provides data on the financial execution of public investment resources distributed across Colombia's territorial entities. The dataset is hosted on the Socrata platform by datos.gov.co and was last updated on 2026-05-27. Columns suggest tracking of committed, obligated, and paid values across departments, municipalities, sectors, and funding sources.
Plan de acción en salud del Municipio de Fusagasugá outlines health activities, budgets, and quarterly targets for the 2020 fiscal year. The dataset includes 25 columns detailing planned activities, responsible parties, financial resources, and performance metrics. It originates from the Colombian open data portal www.datos.gov.co and was last updated on 2026-05-18.
An official report on environmental quality published by Japan's Ministry of the Environment for the year 2003. The document is authored by the Ministry's Environmental Planning Office within the General Policy Division. It is available in HTML and PDF formats.
An official schema used by obligated entities in Colombia to systematically report on published and future information under the proactive disclosure principle of Law 1712 of 2014. The dataset lists metadata for information assets, including format, responsible parties, and update frequency. It is hosted by the Colombian open data portal, www.datos.gov.co, and was last updated on 2026-05-18.
PallasBench is a suite of 45 JAX Pallas kernels across 3 difficulty levels, originally designed for TPU and adapted for GPU. The dataset reports that 39 out of 45 kernels pass on an NVIDIA A100 80GB, representing the first GPU-focused evaluation of JAX Pallas kernels. It was authored by EvanOLeary and last updated on May 30, 2026.
An index of information assets classified as confidential or reserved, managed by Colombian public entities in 2023. The dataset includes columns for assessing impact on trust, integrity, confidentiality, legal implications, and data protection compliance. It originates from the Colombian open data portal, datos.gov.co, and was last updated on 2026-05-18.
Disparity in hours worked between men and women in Colombia, measured as the difference in average weekly hours in primary and secondary jobs. The dataset includes yearly values broken down by geographic domain and granularity, sourced from datos.gov.co. Its last recorded update was on 2026-05-18.
Plan Empresarial de Contratación (PEC) for the 2020 fiscal year, published by CENS SA ESP and hosted on the Colombian open data portal. The dataset includes columns such as Objeto de la Contratación (contract object), Fecha Estimada de Inicio Ejecución (estimated start date), and Tipo de Contrato (contract type). It was last updated on 2026-05-18.
A dataset from NVIDIA, last updated on 2026-06-04, designed to improve the coding capabilities of large language models. It contains metadata corresponding to a raw source-code update for the NVIDIA Nemotron 3 family of models. The dataset is part of the Nemotron Pretraining Data collection.
Australian Communications and Media Authority published anticipatory notices for telecommunications infrastructure. The data covers seven project areas where OptiComm Pty Ltd is contracted to install Fibre to the Premises (FTTP) networks, with notices declared on 7 May 2026 but originally given on 20 September 2022.
Sumideros urbanos, or storm drains, are cataloged by commune, street, and neighborhood in the municipality of Palmira for the year 2021. The dataset is hosted on the Colombian open data portal, www.datos.gov.co, and was last updated in May 2026. Columns suggest a municipal inventory including location, measurements, and sector data.
Tablas de retención documental del ICA updated for the 2022 period. The dataset is hosted by the Colombian open data portal www.datos.gov.co and was last updated on May 18, 2026. It contains columns such as CODIGO, SERIE, SUBSERIE, and TIEMPO DE RETENCIÓN AG, which likely define document series and their mandated retention periods.
Industrial Agent Benchmark v2.2.0 Japanese Canonical Normalization is an open benchmark for evaluating Industrial AI systems and Manufacturing AI assistants. The dataset was created by MSakae and was last updated on June 14, 2026. It is hosted on Hugging Face.
Parks and Protected Areas from the Government of Yukon includes areas set aside for conservation management purposes. The dataset likely contains National Parks, Territorial Parks, and Special Management Areas established under Yukon First Nations Final Agreements. It was last updated on April 17, 2026.
Department of Transport and Planning provides counts of driver licences transferred to Victoria on a quarterly basis. The dataset is available in CSV format and was last updated on May 12, 2026. It is published under a Creative Commons Attribution 4.0 International license.
Solicitudes de Restitución por Departamento y Municipio shows the number of land restitution claims registered, unregistered, in process, and micro-focused, disaggregated by the department and municipality of the property location. The data is sourced from the STRDAF (Registro de Tierras Despojadas y Abandonadas Forzosamente) as per Article 76 of Law 1448 of 2011. It includes counts of properties and titleholders per claim.
Aggregated historic fire severity classification from 1998 onward for wildfires, reclassified using the Level 2 Classification described in the Post fire Burn Classification Procedure SOP v.1.0 (FEMD, 2014). The dataset includes classifications from FIRE_SEV98 to FIRE_SEV09 and specific Victorian fires from 2013 and 2014, with later fires using Level 3 Classification. It is provided by the Department of Energy, Environment and Climate Action.
Guido Veit published raw data on figshare in June 2026. The dataset likely contains experimental results measuring the functional correction of cystic fibrosis transmembrane conductance regulator (CFTR) gating mutants. The 59.0 KB XLS file is licensed under CC-BY-4.0.
A compilation of information on geological processes and terrain hazards in the Yukon Territory. The data was collected and originally published between 1994 and 1996, based on work carried out in three phases from 1992 to 1995. Contractors performed the work under the supervision of the Environmental Geologist, Exploration and Geological Services Division, Yukon Region, Indian and Northern Affairs Canada.