Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
169,153 datasets
Ten months of monitoring data for carabid beetles on Takapourewa, New Zealand, collected via pitfall traps and ACOs. The dataset was authored by Mark Anderson and is available as a 323.1 KB XLSX file under a CC-BY-4.0 license.
Signalling questions from the RoB 2 tool for a meta-analysis comparing early versus delayed anticoagulation in patients with non-valvular atrial fibrillation-related acute ischemic stroke. The dataset is a 2.4 MB XLSM file authored by Xiaodan Zhang and last updated on 2026-05-29. It is licensed under CC-BY-4.0 and hosted on figshare.
SEGUIMIENTO PLAN INDICATIVO is a monthly monitoring report for the indicative plan of the municipality of Manizales, Colombia, with a cut-off date of November 2025. The dataset was published by the Secretariat of Planning in compliance with the municipal development plan for the 2024-2027 period. It tracks program performance through indicators, targets, and normalized compliance metrics.
An open, daily-updated AHR999 Bitcoin hoarding index dataset self-computed from Binance BTCUSDT daily closes. The dataset is converted to TsFile format and mirrored from its original GitHub repository. The canonical dashboard and data endpoints are provided by the original author.
Topographical maps at a scale of 1:25,000 were produced for military purposes and classified as Confidential. The dataset includes variants for topographic maps (TK) and topographic city maps (TSP). It was published by the Bundesamt für Kartographie und Geodäsie, with a last recorded update in 1985.
A dataset created using the LeRobot framework for robotics applications. The data likely contains action commands for a 6-degree-of-freedom robot arm, as suggested by the feature names. It was authored by VEXAutoSort and last updated on June 11, —.
A 9.5 GB dataset containing in situ and simulated event data associated with the UEP manuscript. The data was authored by Yasuhito Hayashi and is available under a CC-BY-4.0 license. It was last updated on June 2, 2026.
ONSPD MAY 2015 csv V2 is a postcode directory published by the Office for National Statistics. The dataset likely contains geospatial and administrative data for UK postcodes as of May 2015. Its specific content and scale require verification after download.
A password-protected bundle of public deep-research benchmarks used by AgentHarness to evaluate the Apodex-1.0 AI model in standard ReAct mode. The dataset was authored by 'apodex' and last updated on 2026-06-08. Its specific contents and scale are not detailed in the provided metadata.
Bundesamt für Kartographie und Geodäsie produced this topographic map series at a scale of 1:25,000. The maps were primarily for military purposes and originally classified as Confidential. The series includes variants for general topography (TK) and city maps (TSP).
The Government of Nova Scotia provides a dataset listing predominant and traditional family names of African Nova Scotians across six regions of the province. It includes over 48 traditional communities, mapping family names and communities to specific geographic areas. The data supports cultural, historical, and demographic research related to African Nova Scotian heritage.
A geospatial point dataset shows the locations of tables in the Australian Capital Territory. Assets are owned or managed by the City Services, City and Environment Directorate and Parks and Conservation Service. The dataset was last updated on April 4, 2026.
Metabolomic raw data from a project investigating the IbADT6 gene's role in sweet potato. The 474.7 KB dataset includes R scripts and Excel files, published under a CC-BY-4.0 license by rui pan. It was last updated on May 30, 2026.
A historical archive preserving the Nomadic Samuel Top 100 Travel Blogs ranking from the early-to-mid 2010s. The dataset includes a final composite ranking, metric-specific ranking tables, blog entity records, and methodology context. It was created by samuelandaudreymedianetwork and last updated on Hugging Face in May 2026.
Spell Correction RU is a dataset for training models to correct spelling, punctuation, and case errors in Russian text. Each example is a pair of correct text and text with errors. The dataset was used to train the model melsmm/Spell-Corrector-RU-4B.
A list of innovative educational spaces equipped with advanced technology (ICT) to transform teaching. The dataset is hosted on the Colombian open data portal www.datos.gov.co and was last updated on 2026-05-25. It includes columns for grade level, device type, educational institution, classroom, brand, campus, and model.
Approved project allocations and contributions for Country-Based Pooled Funds in Kenya. The dataset is provided by the Country-Based Pooled Funds (OCHA) and was last updated on 2026-05-19. It likely contains records of financial distributions and donor inputs for humanitarian projects.
Country-Based Pooled Funds allocations and contributions for Honduras. The dataset contains approved project allocations and the contributions received by each fund. It is published by the Country-Based Pooled Funds (OCHA) and was last updated on 2026-05-19.
El Salvador humanitarian funding data contains approved project allocations from Country-Based Pooled Funds (CBPFs) and the contributions received by each fund. The dataset is provided by the Country-Based Pooled Funds (OCHA) organization and was last updated on 2026-05-19. It is available in CSV format under a CC-BY-4.0 license.
Country-Based Pooled Funds (CBPFs) data for Bangladesh tracks approved project allocations and contributions received. The dataset is published by the United Nations Office for the Coordination of Humanitarian Affairs (OCHA) and was last updated in May 2026. It likely contains financial records for humanitarian funding flows within the country.