Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
168,146 datasets
A 3.1 GB high-resolution digital master copy of manuscript HC.MS.01618 from the Qatar National Library Heritage Collection. The dataset, released under a CC0-1.0 license, contains the twenty-ninth part of the Quran. The file is available as a ZIP archive and was last updated on May 20, 2026.
A 46.5 GB high-resolution digital master copy of manuscript HC.MS.02642 from the Qatar National Library Heritage Collection. The manuscript is titled 'Tafsir Al-Baydawi' and authored by Abdullah bin Umar Al-Baydawi. It was published on figshare by Qatar National Library under a CC0 1.0 license.
Qatar National Library provides a high-resolution digital master copy of manuscript HC.MS.01722 from its Heritage Collection. The dataset is a single 11.2 GB ZIP file containing the digitized manuscript. It was last updated on 2026-05-20 and is released under a CC0 1.0 public domain license.
Reprocessed Level 2 non-time critical geophysical data from the Sentinel-6A Michael Freilich spacecraft's Advanced Microwave Radiometer, with a 60-day latency. The dataset includes surface type, wind speed, water vapor, brightness temperature, sigma0, and wet troposphere measurements, interpolated to correspond with Poseidon-4 SAR altimetry intervals to supply environmental corrections. This product is analogous to the Jason-3 GDR geophysical data record.
The 2023-24 financial year register lists all active and compliant contracts valued at £10,000 or more for the London Borough of Barnet. It provides a snapshot of the council's procurement and financial commitments, with contracts live at the time of publication. Some listed contracts may have expired before subsequent quarterly updates.
2020 data tracking municipal investment projects in La Paz, Santander, Colombia. The dataset includes project descriptions, values, durations, responsible agencies, and contractors. It was published via the Socrata platform on datos.gov.co and was last updated in May 2026.
OSNI's smallest-scale raster product provides a static image overview of Northern Ireland's natural environment. Published as open data by the Government Digital Service under the OGL-UK-3.0 license. The data is distributed in HTML, ZIP, and JSON formats.
A 1976 observer's report from the Vema cruise 33 leg 4, focusing on the Naturaliste Fracture Zone. The dataset is a legacy product published on data_gov_au by the Australian Ocean Data Network. It likely contains textual observations and findings from the cruise conducted between 23 February and 15 March, 1976.
ProCUA-SFT is a large-scale synthetic trajectory dataset for training computer-use agents (CUAs). It was created by NVIDIA and last updated on June 12, 2026. The dataset is designed for supervised fine-tuning of screenshot-based desktop agents that operate graphical environments using mouse, keyboard, and code-like actions.
Flux estimates for carbon dioxide and methane from two interfluvial wetland sites in Brazil's upper Negro River basin. The dataset provides daily and monthly calculations for diffusive and ebullitive fluxes across dry, seasonally flooded, and permanently flooded areas from February 2005 to January 2006, with hydrologic measurements extending from April 2004. Data was produced by ORNL_CLOUD using field measurements and synthetic aperture radar analysis from Radarsat images.
A high-resolution digital master copy of manuscript HC.MS.2020.0021 from the Qatar National Library Heritage Collection. The manuscript is titled 'Kitab Salat al-Qadiriyya al-Sufiyya' and is attributed to al-Sudi, Abd al-Hadi, dating to approximately 1525 or 1526. The dataset was published by Qatar National Library under a CC0 1.0 license.
Information on Indigenous Communities with a cut-off date of December 31, 2020. The dataset is provided by www.datos.gov.co and was last updated on the platform in May 2026. It contains columns for municipality, department, and community names.
A 1:1,000,000 scale raster map provides a static image of county boundaries in Northern Ireland. Published by the Government Digital Service for OpenData, this dataset is the smallest-scale raster product from OSNI, offering an overview of the region. The data is available under the OGL-UK-3.0 license.
A 1.2 GB high-resolution digital master copy of manuscript HC.MS.01004 from the Qatar National Library Heritage Collection. The manuscript is titled 'Al-Jami' Al-Sahih min al-Sunan' (The Authentic Compilation of the Prophetic Traditions), part 22, authored by Muhammad ibn Isma'il al-Bukhari (810-870). The dataset was last updated on 2026-05 06:24:58 and is shared under a CC0-1.0 license.
Qatar National Library provides a 4.4 GB high-resolution digital master copy of manuscript HC.MS.2020.0029. The manuscript, titled 'The Muhammadan Method in Advising Humanity', was authored by Muhyi al-Din Muhammad ibn Bir 'Ali ibn Iskandar (circa 1522-1573) and dates to 1585. The dataset was last updated on 2026-05-06 and is shared under a CC0-1.0 license.
CTD profile measurements of temperature and salinity were collected at a single monitoring station in the Derwent Estuary. The Australian Ocean Data Network provides this data covering a six-month period from August 2012 to January 2013. Data is available in formats including NetCDF, which is commonly used for scientific environmental data.
Bering Sea coastal communities in Alaska are the focus of this dataset documenting Yup'ik place names and environmental knowledge. It contains more than 3,000 identified names for features like camp sites, rivers, rocks, and underwater channels. The data also includes Yup'ik perspectives on the importance of place names, land, values, and language.
The Injury/Illness Summary - Operational Source Data (Form 55) dataset contains monthly raw operational data submitted by all railroads to the Federal Railroad Administration. It includes metrics such as train miles, employee hours worked, yard switching miles, passenger counts, and passenger miles. The dataset is maintained by the Department of Transportation and replaced a legacy download system.
Australian spectral data from the Coringa Herald region, hosted in the National Spectral Database Aquatic Library. The dataset supports remote sensing for mapping and change detection in tropical marine protected areas. It is cited in peer-reviewed research on enhancing coral detection under varying water conditions.
A 14.9 GB high-resolution digital copy of the manuscript 'Kitab-i Marifetname' from 1824, authored by Erzurumlu Ibrahim Hakki (1703-1780?). The dataset is provided by Qatar National Library under a CC0 1.0 license and was last updated on 2026-05-06. It is a digitized master copy of a historical manuscript from the library's heritage collection.