Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
165,940 datasets
Sectoral division of leisure districts for the city of Saint-Hyacinthe, provided by the local recreation department. The dataset was created using computer-aided mapping and is maintained by the Government and Municipalities of Québec. It was last updated on April 22, 2026.
Supplementary tables from the ACHIEVE project analyzing HIV care outcomes in Tanzania. The data covers 21,448 children living with HIV (CLHIV) with undetectable viral load at baseline and 4,809 CLHIV with detectable viral load at baseline, as of July 15, 2023. It was authored by Amon Exavery and published on figshare under a CC-BY-4.0 license.
A clinical dataset reporting baseline characteristics for patients grouped by rotator cuff tear size. The data, authored by Ala' Hawa, was last updated on May 22, 2026. It is a 9.5 KB Excel file containing continuous variables reported as mean ± standard deviation, and categorical variables as counts and percentages.
Saint-Hyacinthe municipality's evaluation units are represented as polygons in this dataset. The data was created and updated through computer-aided mapping in collaboration with the evaluation department and is published by the Government and Municipalities of Québec. It was last updated on 2026-04-22.
Two article datasets collected from 22 reputable media outlets for the QuotAnaSum system. Each article contains between 1000 and 2000 words, and the data is stored in a 3.5 MB JSON file. The dataset was authored by Shiyu Han and last updated on May 21, 2026.
A linear geospatial layer illustrating lot divisions for properties containing multiple lots in Saint-Hyacinthe. The dataset is maintained in collaboration with the municipal evaluation department using computer-aided mapping. It was last updated on 2026-04-22 and is published by the Government and Municipalities of Québec under a CC-BY-4.0 license.
Evaluation unit registration point - Saint-Hyacinthe is a geospatial point layer containing the centroids of evaluation units from the graphic matrix. The dataset is provided by the Government and Municipalities of Québec and was last updated on April 22, 2026. It includes attributes for identifiers, municipality codes, registration dates, scales, and administrative divisions.
Shivesh Prakash released this 10.1 GB collection of training data, trained models, and associated files for the MHNpath retrosynthetic planning tool in April 2026. The data supports a machine learning framework that prioritizes reaction templates and allows user tuning based on cost, temperature, and toxicity. It includes case studies on complex molecules like dronabinol and benchmarks against established pathways from PaRoutes.
Supplementary material 3 provides velocity time series for observation sites along the Nankai Trough. The 4.3 KB CSV file, authored by Yusuke Yokota, contains data supporting research on decadal slip deficit rates. It was last updated on June 3, 2026.
TRL-Rbench is a standardized evaluation suite for tabular representation learning models. The dataset includes a row_prediction component with 50 OpenML tables and 1.1 million rows across 123 hand-verified targets. It was created by logo-lab and last updated on June 11, 2026.
United Kingdom Continental Shelf data details provisional licence awards for carbon storage. The dataset is provided by the North Sea Transition Authority and includes geospatial information in the WGS84 coordinate reference system. It is available in multiple formats including GPKG, XLSX, CSV, and GeoJSON.
UKCS carbon storage provisional licence awards (ETRS89) details provisional licenses for carbon storage sites in the UK North Sea. The dataset is provided by the North Sea Transition Authority and includes multiple file formats for geospatial and tabular data. Its cross-platform presence suggests it is an important resource for energy and environmental stakeholders.
Raw JSONL manifests contain the results for the AutoArk-AI/ARK-ASR-3B model on the public English short-form splits from the hf-audio/open-asr-leaderboard. The results were generated and scored on June 22, 2026, using the shared Open ASR Leaderboard scorer on a local machine with 8x RTX 4090 GPUs. The dataset was authored by AutoArk-AI and hosted on Hugging Face.
An anticipatory notice from MyPort Pty Ltd for a telecommunications project at 402 Macaulay Road, Kensington VIC. The notice includes an estimated completion date of 31 October 2025, a contract signed date of 4 August 2025, and specifies the network type as FTTB. It was published by the Australian Communications and Media Authority and last updated in May 2026.
A statutory anticipatory notice for telecommunications infrastructure, declared on 11 March 2026. The notice, given on 29 June 2023, covers a project area at 581-587 Mt Petrie Road Mackenzie QLD 4156 for installing Fibre to the Premises (FTTP). The contract was dated 26 June 2023 with an estimated completion date of November 2024.
An anticipatory notice for fiber-to-the-premises (FTTP) network works by MyPort Pty Ltd, declared by the Australian Communications and Media Authority. The project at 14 Kerr Road West Kallangur QLD 4503 has a contract date of 11 November 2024 and an estimated completion date of 30 June 2025. The dataset includes PDF and ZIP MAPINFO files published under a CC-BY-4.0 license.
A 1960 comparison of magnetic field instruments calibrated at Toolangi against a proton precession magnetograph at Woomera. Observations were conducted on June 27th and 28th, 1960 by the Weapons Research Establishment. The record likely contains calibration data and comparisons between different measurement technologies.
Ruiqiong Zhou's study on figshare analyzes 2,058 single frozen–thawed day-6 blastocyst transfers from August 2021 to August 2024. The research investigates the interaction between progesterone exposure duration and blastocyst expansion stage on live birth rates. The dataset likely contains clinical variables and outcomes from this retrospective cohort study.
Sentinel-1A SAR VV Backscatter Quicklooks are medium-resolution (approximately 20m pixel size) radar images processed by CSIRO for the eReefs Phase 5 project. The product is derived from Level 1 IW GRD data with VV polarization, reprojected onto a regular grid and filtered to reduce speckle. It is available as a multi-format geospatial service for environmental monitoring.
RyeAI/ftspeech-pnc-da is a text-only companion dataset for Danish speech processing. It provides restored punctuation and capitalization for utterances from the original FTSpeech dataset, with each row containing an utterance_id for joining back to the source. The dataset was created by RyeAI and last updated on 2026-06-19.