Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
158,265 datasets
A 46.5 MB video file showing the second stage of making a basket trap, specifically adding new palm leaves to the rope and tying sticks to the hoop. The footage was compiled by Marie-Annick Moreau from two source recordings. It was last updated on June 3, 2026, and is shared under a CC-BY-NC-SA 4.0 license.
Compiled videos document the first stage of making a basket trap, from carving sticks to tying them onto the top ring. The dataset consists of 80.0 MB of WAV format video files uploaded by Marie-Annick Moreau. It was last updated on 2026-06-03.
An audio recording of Mzee Kulenga discussing the characteristics of vines used for making basket traps. The file is 47.9 MB in WAV format, uploaded by Marie-Annick Moreau and last updated on June 3, 2026. It focuses on the advantages and availability of the mngombe vine species compared to others.
Public educational institutions in San José de Cúcuta, Colombia, that have received free fiber-optic internet connectivity services. The dataset includes columns for school names, addresses, student counts, internet speed, and precise geographic coordinates. It was published by www.datos.gov.co and last updated on 2026-05-18.
ARN (Agencia para la Reincorporación y la Normalización) provides national and regional statistics on habitat conditions, access to public services, and critical overcrowding for individuals in the reintegration process. The dataset includes columns for water, electricity, waste collection, sewerage, and critical overcrowding status, organized by municipality and department. It was last updated on 2026-05-18 and is hosted on the Colombian open data portal www.datos.gov.co.
Demobilized individuals from armed groups like M-19, Quintin Lame, and FARC-EP, recorded under specific Colombian amnesty laws and decrees. The data is structured by type, total count, category, and department of demobilization. It is hosted by the Colombian government's open data portal, www.datos.gov.co, with a last recorded update timestamp of 2026-05-18.
An index of acts, documents, and information classified as confidential or reserved by Colombian public bodies, as mandated by Law 1712 of 2014 (Transparency and Access to Public Information Law). The dataset includes columns for classification rationale, legal basis, responsible officials, and retention periods. It is published by datos.gov.co and was last updated on 2026-05-18.
Risaralda, Colombia, hosts a list of registered companies providing fumigation and pest control services. The dataset includes company names, municipalities, services offered, pest types, methods used, sectors served, and email contacts. It was published on www.datos.gov.co and last updated on 2026-05-18.
A dataset from the MINT Lab study of LLM preference coherence over parametric outcome ladders. It contains 100 validated ladders across 12 value categories, each structured as a 7-tier scale. The dataset was authored by MINTLABJHUANU and last updated on June 17, 2026.
Nanson et al. (2023) produced this geospatial seabed morphology and geomorphology dataset for the Beagle Marine Park in south-eastern Australia. The dataset is published as an ESRI web map service (eCat Record 147976) and is managed by the Australian Ocean Data Network. It was last updated on 2026-06-05.
A log of citizen requests for open data submitted through the 'participa' section of Colombia's datos.gov.co platform. The dataset includes columns for request details, responding entities, and status updates. It is hosted by the www.datos.gov.co organization and was last updated on 2026-05-28.
Forestry harvest data from the Risaralda Regional Autonomous Corporation details approved volumes and tree counts for public and private properties. The dataset includes columns for Municipality, Department, Year of Information, Harvest Class, and Volume. It is provided by the Colombian open data portal www.datos.gov.co and was last updated in May 2026.
2010 to 2024 fetal mortality rates for municipalities in the Bolívar Department, Colombia. The dataset includes annual numerators (fetal deaths), denominators (live births), and calculated rates per 1,000 live births. Data originates from Colombia's National Administrative Department of Statistics (DANE) and was last updated on the Socrata platform in May 2026.
A registry of information assets published by Aguas Regionales EPM under Colombia's 2014 Transparency and Access to Information Law 1712. The dataset includes columns for document typology, format, content description, and responsible producer. It was last updated on 2026-05-18 16:39:00 and is available via the www.datos.gov.co portal.
National and regional information on individuals who entered a reintegration process and received assistance for higher education. The dataset includes columns for municipality, department, educational level, and process status. It is hosted on the Colombian open data portal www.datos.gov.co and was last updated on 2026-05-18.
Predicted high and low water level heights at the Amrun tide gauge location. The dataset is provided by Transport and Main Roads (Queensland) and was last updated on 2026-05-29. It is available as a CSV file under a CC-BY-4.0 license.
Gold Coast Seaway predicted water level heights at regular time intervals. The data is provided by Transport and Main Roads (Queensland) and was last updated in May 2026. The dataset is available in CSV format under a CC-BY-4.0 license.
The 2024 Nigeria Demographic and Health Survey (NDHS) provides a nationally representative dataset of 36,161 women aged 15–49, harmonized for machine learning analysis. It contains 43 variables on socio-demographics, health access, media exposure, and COVID-19 knowledge, with a binary outcome for vaccine uptake. The data includes sampling weights and engineered features to support predictive modeling of health behavior.
Yopal, Colombia's city hall maintains a registry detailing its classified and reserved information assets. The dataset includes columns for information classification, confidentiality, integrity, responsible personnel, legal justifications, and retention periods. It is published via the Colombian open data portal and was last updated on May 18, 2026.
Data from 40 municipalities in Norte de Santander, Colombia, reported to the Departmental Health Institute and the Secretariat for Drinking Water and Basic Sanitation. The dataset contains the Water Quality Risk Index (IRCA) for public utility companies and cooperative associations. It was last updated on 2026-05-18.