Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
153,444 datasets
Hayato Harima published ANOVA and Tukey test results for viral titers on 2026-05-26. The 15.7 KB Excel file contains statistical analysis of recombinant virus infections in Vero E6 cells measured at 72 hours post-infection.
A dataset showing the distribution of microalbuminuria prevalence across quartiles of the Waist-to-Height Ratio (WHtR). The data includes 95% confidence intervals and is provided in an XLS file. It was authored by Xia Huang and last updated on 2026-05-19.
Australian Communications and Media Authority published anticipatory notices for Opticomm Pty Ltd infrastructure projects. The data includes two project areas with estimated completion dates, addresses, and geographic coordinates. The notices were given on 31 May 2025 and partly declared on 29 April 2026.
A directory of registered vehicle mechanical workshops operating in the Huila Department of Colombia. The dataset includes location and contact information for workshops, compiled from public records maintained by the Huila Chamber of Commerce. It was last updated on 2026-05-18.
40.9 KB Excel file provides biomass data for dominant grassland species. Zhening Zhu published the dataset on figshare in June 2026. It likely contains aboveground and belowground biomass measurements for the top four species at each sampling site.
Colombian municipal and district administrations present initiatives for potable water and basic sanitation investment projects in their development plans. The dataset includes columns for project description, estimated value, funding sources, and study status. It was issued on 20240802 and last updated on 2026-05-18 18:45:35 via the Socrata platform.
Mixed Beverage Tax revenue distributions are tracked for cities and counties in Texas. The dataset likely contains monthly or periodic reports detailing tax payments, year-to-date totals, and comparisons to prior periods. It is published by the City of Austin and appears on multiple data platforms.
Council spending data published monthly by the Government Digital Service. The dataset covers transactions from April 2016 onward and is available in multiple formats including CSV, XML, and JSON. The description notes that publication timescales may have been affected during the COVID-19 pandemic.
12,000 feet of marine conglomerate, sandstone, limestone, and shale are described for the Upper Devonian and Carboniferous platform sequence. The data, provided by the Australian Ocean Data Network, details a geological formation disconformably overlain by 350 feet of terrestrial sandstone. Last updated metadata indicates a record from 2026-06-05.
5.3 GB of fitted model parameters from a computational neuroscience study on inter-area brain dynamics. The dataset, authored by Mitra Javadzadeh, supports the findings of a 2024 bioRxiv preprint on dynamic consensus-building between neocortical areas. It was last updated on May 17, 2026.
Members of the Risaralda Regional Competitiveness Commission in Colombia, including their affiliated entities and contact information. The dataset is hosted on the Colombian open data portal www.datos.gov.co and was last updated on 2026-05-18. It lists individuals with their associated companies, positions, and municipality-level entity details.
20 comparative tests validated an ESP32-based weighing device for measuring cleaning agent use in hospital sterile supply departments. The code likely contains the firmware and application logic for the device, which was tested over 107 cleaning cycles. Wei Zheng authored this dataset, which was last updated on April 10, 2026.
An ArcGIS InstantApp from the City of Hobart replaces a legacy WebAppBuilder tool for public navigation. The dataset's primary function is to provide a digital interface for locating and interacting with physical wayfinding signage. Its cross-platform presence on data.gov.au suggests it is an official, maintained resource for the city.
LockerNYC is a pilot program operated by GoLocker for the City of New York, providing secure public lockers for package delivery and pickup. The dataset records actions like receiving, reserving, and withdrawing packages from sidewalk lockers across the city. Columns suggest detailed tracking of delivery and pickup durations, locker locations, and associated geographic and administrative boundaries.
50 individual mussels of the species *Mytilus edulis* and *M. galloprovincialis* are represented in this dataset. It provides the number of byssal threads and the corresponding tenacity for individuals that are either non-infested or infested by endolithic symbionts. The dataset was authored by Laurent Seuront and last updated on 2026-05-29.
Consolidated list of entities subject to control by the Departmental Comptroller's Office of Huila, Colombia. The data includes contact details and location for public entities and private legal or natural persons managing municipal resources. The dataset was last updated on 2026-05-18 16:55:52 and is hosted on the Colombian open data portal.
Bee occurrence and trait data collected in 2023 from communities along a 2,957-meter elevation gradient in the Colombian Andes. The data was used in the 2026 publication 'Tropical bee assemblage diversity decreases with elevation while body size increases' by Turley et al. in Biotropica. It was authored by Nash Turley and shared under a CC-BY-4.0 license.
Hydrocoherent numerical terrain models provide a regional representation of Quebec's relief based on altimetric and planimetric data. The models are a collaborative product from the Ministry of Natural Resources and Forests and Natural Resources Canada, offering a quality portrait of relief at a 1:50,000 scale. They feature a grid resolution of 0.324 arcseconds, corresponding to approximately 10 meters on the ground.
A dataset of 93 patients with sellar region brain tumors, including 40 Langerhans cell histiocytosis (LCH) and 53 germ cell tumor (GCT) cases, collected between April 2012 and April 2024. Radiomics features were extracted from multiparametric MRI scans (T1WI and T2WI) with manually segmented regions of interest. The data was used to develop and validate machine learning models for tumor classification.
A gene expression signature model for lung adenocarcinoma prognosis, constructed using LASSO, XGBoost, and Random Forest algorithms. The dataset was created by Guannan Wang and last updated on May 1, 2026. It integrates single-cell RNA-seq data and includes experimental validation of the core gene PABPC1.