Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
168,994 datasets
A high-resolution digital master copy of manuscript HC.MS.02486 from the Qatar National Library Heritage Collection. The 10.5 GB dataset was published by Qatar National Library and last updated on 2026-05-17. It is available under a CC0 1.0 public domain dedication.
A high-resolution digital master copy of manuscript HC.MS.02487 from the Qatar National Library Heritage Collection. The manuscript is titled 'Kitab Al-Filaha' and authored by Ibn Bassal, Abdullah Muhammad bin Ibrahim. The dataset is a 7.4 GB ZIP file published under a CC0 1.0 license.
A high-resolution digital master copy of manuscript HC.MS.02603 from the Qatar National Library Heritage Collection. The 832.0 MB ZIP file contains a digitized manuscript titled 'مجموع مقالات في الإسلام' (A Collection of Articles on Islam). Qatar National Library published this dataset under a CC0-1.0 license, with a last update timestamp of 2026-05-17.
Quarterly records from 2020 of public entities registered in Colombia's Public Finance and Information Consolidator System (CHIP). The dataset includes contact details and location information for each entity, sourced from the Colombian open data portal datos.gov.co. The data was last updated on May 18, 2026.
A 6.8 GB high-resolution digital master copy of manuscript HC.MS.02640 from the QNL Heritage Collection. Qatar National Library authored this dataset, which was last updated on 2026-05-17. The dataset provides a direct link to the digitized manuscript in the QNL Digital Repository.
Qatar National Library provides a high-resolution digital master copy of manuscript HC.MS.2016.0069 from its Heritage Collection. The dataset consists of a ZIP file containing two manuscript pages described as having 'samāʿāt' (listening certificates or audition notes). The data was last updated on 2026-05-17 and is published under a CC0-1.0 license.
Muthulakshmi Kirubakaran published a 5.5 KB Excel file on figshare in May 2026. The dataset contains the best validation Dice score per fold from a 10-fold cross-validation procedure, presented as a mean and standard deviation across folds. The data likely originates from a machine learning experiment involving a model trained on 10 tables.
5.5 KB of results from a 5-fold cross-validation procedure, showing the best validation Dice score per fold. The data, published by Muthulakshmi Kirubakaran on figshare, was last updated on May 18, 2026. It is stored in an XLS file format.
A 14.1 GB high-resolution digital master copy of manuscript HC.MS.00871 from the Qatar National Library Heritage Collection. The manuscript is titled 'Al-Durar Al-Nathir 'ala Ajwibat Abi al-Hasan al-Saghir' by Ibn Hilal, Abu Salim Ibrahim bin Hilal Ali al-Ghallali. The dataset was last updated on 2026-05-03 and is provided under a CC0 1.0 license.
Qatar National Library provides a 1.6 GB high-resolution digital copy of a historical manuscript, HC.MS.2021.0003, from its Heritage Collection. The manuscript is a collection of three books by the physician and philosopher Abu Bakr Muhammad ibn Zakariya al-Razi (c. 865-925). The dataset was last updated on May 3, 2026, and is shared under a CC0 1.0 license.
Reiwa 7 (2025) fiscal year report published by the Japan Fair Trade Commission. The report is authored by the General Affairs Division and was last updated on the platform in June 2026. It likely contains statistics and analysis on antitrust enforcement, market competition, and regulatory activities in Japan.
Information to understand the management and rehabilitation of contaminated sites under the responsibility of the Government of Quebec. The dataset is built by departments and agencies listing sites, applying a reference framework, and estimating costs for characterization, rehabilitation, monitoring, and maintenance. It was last updated on 2026-04-17.
Contracts of $25,000 and over awarded by the borough council of Côte-des-Neiges—Notre-Dame-de-Grâce, published annually. The list covers periods from June to May historically, and from January to December since 2017. The dataset is provided by the Government and Municipalities of Québec.
An inventory of information assets managed by the Sogamoso municipality government. The dataset includes details on asset type, responsible office, location, and availability, aiming to provide transparency about public administration resources. It is publicly available in digital-electronic format through the government's open data platform.
Beginning in 2019, this dataset tracks collisions at Metropolitan Transportation Authority (MTA) bridge and tunnel facilities. It is published by data.ny.gov and includes monthly counts of collisions and injury-involved collisions, normalized per million crossings. The dataset was last updated on 2026-05-15.
GEMS created this test dataset to validate data transfer work for the PEGA system. The dataset is published by the Australian Government Department of Climate Change, Energy, the Environment and Water and was last updated on 2026-05-19. Its specific purpose is to test and validate data transfer processes.
Full Load Equivalent (FLE) enrollment data for publicly funded post-secondary institutions in Alberta. The dataset uses CIP 2011 codes to standardize fields of study and provides a comparable metric across different programs and institutions. It is published by the Government of Alberta and was last updated on April 17, 2026.
115 to 310 nm wavelength spectra are provided at a high spectral resolution of 0.1 nm. The dataset contains merged daily solar spectra constructed from the SORCE SOLSTICE FUV and MUV instrument, with irradiances reported at a mean solar distance of 1 AU. Each netCDF file contains variables for date, Julian day, wavelength, irradiance value, uncertainty, and repeatability.
SORCE SOLSTICE Level 3 data provides daily averages of the magnesium II core-to-wing index, a key proxy for solar ultraviolet activity. The dataset is a tabular ASCII file containing calendar date, Julian Day, the core/wing ratio, and its absolute uncertainty for each daily measurement. Spectral resolution of 0.1 nm allows the Mg-II doublet to be fully resolved and modeled with Gaussians.
11.8 MB of geospatial data files published on figshare by liu jiaheng in June 2026. The dataset likely contains shapefiles and models analyzing global zoogeographical divisions for terrestrial vertebrates. Columns suggest it includes spatial boundaries and factors influencing these biological regions.