Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
146,448 datasets
A dataset of 597 experimental test results on red sandstone samples, investigating dynamic compressive strength under coupled acidic wetting-drying cycles. Bin Du authored this dataset, which was last updated on May 14, 2026. It was used to develop and validate five ensemble learning models for predicting mechanical behavior in chemically degraded environments.
597 experimental test results on red sandstone dynamic compressive strength under coupled acidic wetting-drying cycles. The dataset was used to develop and validate five ensemble learning models, including Random Forest and XGBoost. Authored by Bin Du and last updated on May 14, 2026.
9.5 KB of correlation coefficients for inhalation amplitude, including means and 95% posterior boundaries. The data is structured by Electromagnetic Articulography (EMA) dimension and sentence type. Tabea Thies published this dataset on figshare under a CC-BY-4.0 license, with a last update timestamp of 2026-05-28.
Myport Pty Ltd's anticipatory notice for a fiber-to-the-premises project at 117 Mina Parade, Alderley QLD 4051. The Australian Communications and Media Authority published this record, which includes a contract date of 20 June 2022 and an estimated completion date updated to February 2025. The notice was originally given on 5 July 2022 and varied on 18 September 2024.
Monthly management information on staff numbers and paybill costs across Civil Service departments, agencies, and executive NDPBs. The data provides headcount and full-time equivalent figures for both payroll and contingent labor, with payroll costs broken down into salaries, allowances, and pension contributions. It is published by the Northern Ireland Office and was last updated on 2026-05-18.
August 2003 to July 2004 field observations, GPS points, and polygons of deforested areas in the Brazilian state of Mato Grosso, between Nova Mutum and Sinop. This dataset was created to validate Moderate Resolution Imaging Spectroradiometer (MODIS) data at 250-meter resolution for deforestation detection. It comprises 16 files, including 10 shapefiles and 6 CSV files.
Typical Utility Bill Information Electric: 2011 from data.ny.gov provides typical average monthly electric bills for residential, commercial, and industrial customers. The dataset likely contains computed bills for standardized usage scenarios across different customer types and seasons. Columns suggest detailed breakdowns of total cost, usage, demand, and load factors for various utility companies.
A dataset from www.datos.gov.co, last updated on 2026-05-18, listing products that have been called for official review by INVIMA, Colombia's national food and drug surveillance institute. It allows citizens and users to check if products they consume or that are marketed in the country are under review for non-compliance with quality, safety, and efficacy standards. The dataset includes columns detailing the reason for the call, the product, the establishment, and the official documentation.
L&I Public Works Apprentice Utilization tracks workforce participation on Washington State public works projects. The dataset is updated daily from weekly certified payroll reports for projects initiated after July 1, 2019. It provides interim data on apprentice hours, minority utilization percentages, and wages across numerous construction trades.
Deviation metrics from a reference RGB midpoint (R=10, G=10, B=10) quantify sensor stability under controlled illumination. The 5.5 KB XLS file, authored by Nadun Salinda and last updated in May 2026, includes mean deviation, standard deviation, and 95% confidence intervals from repeatability testing.
Sea-viewing Wide Field-of-view Sensor (SeaWiFS) imagery captures the southern African region at 4.5-km resolution. The dataset includes Level-1a swaths selected for clarity and some hazy days for contrast, stored in HDF files with 8 spectral bands from 402 to 885 nm. It was produced by NASA as part of the SAFARI 2000 Project.
691 histopathological images extracted from the publicly available LungHist700 dataset, categorized into 151 normal subjects, 280 lung adenocarcinoma subjects, and 260 lung squamous cell carcinoma subjects. The dataset, authored by Nepolian Vailankanni and last updated in April 2026, contains results from a study proposing a novel deep learning approach for lung cancer diagnostics. The proposed method achieved a reported balanced accuracy of 97% on this data.
Nepolian Vailankanni's dataset, published on figshare in April 2026, contains performance metrics comparing a novel machine learning approach to a baseline model for lung cancer diagnosis from histopathological images. The 5.5 KB XLS file likely contains metrics such as balanced accuracy scores derived from experiments on three public datasets, including LungHist700 with 691 images across three categories. The proposed method achieved a 97% balanced accuracy, compared to 86% for the baseline Vision Graph Convolutional Neural Network.
Nepolian Vailankanni published a dataset on 2026-04-27 containing computation time metrics for various machine learning methods. The data, stored in a 5.5 KB XLS file, likely compares the performance of different techniques for lung cancer diagnosis from histopathological images. The research described used images from the LungHist700, LC25000, and TCGA UT datasets.
97% balanced accuracy was achieved by a proposed method for classifying lung cancer from histopathological images. The dataset contains model settings for techniques including Deep Cluster, Bootstrap Your Own Latent, and SimCLR, combined with Minimum Redundancy Maximum Relevance feature selection and a Vision Graph Convolutional Neural Network classifier. It was authored by Nepolian Vailankanni and last updated in April 2026.
New York State's dataset of all currently active real estate appraiser licensees. Each record details an individual licensee's name, unique ID, license type, certification dates, and their associated business information. The data is maintained by the state and updated across multiple platforms, indicating its official status.
Qatar National Library provides a high-resolution digital master copy of manuscript HC.MS.00557 from its Heritage Collection. The 6.1 GB dataset is published under a CC0 1.0 license and was last updated on June 4, 2026. The data is hosted on figshare and links to the full catalog record and digitized manuscript viewer.
A high-resolution digital master copy of manuscript HC.MS.2017.0009 from the Qatar National Library Heritage Collection. The dataset is provided by Qatar National Library and was last updated on June 4, 2026. The original manuscript is a Quranic fragment, or 'raq'a qur'aniyya'.
A 15.2 GB high-resolution digital master copy of a Quran manuscript from the Qatar National Library Heritage Collection. The dataset is hosted on figshare and was last updated on June 4, 2026. It is provided under a CC0 1.0 Public Domain Dedication license.
A high-resolution digital master copy of manuscript HC.MS.2017.0041 from the Qatar National Library Heritage Collection. The 15.1 GB dataset was published by Qatar National Library and last updated on June 4, 2026. It is available under a CC0-1.0 license.