Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
147,059 datasets
From 2018 to April 2026, this database contains all jurisdictional conflicts presented to the Constitutional Court of Colombia. The data was last updated on May 4, 2026, and is provided by the platform www.datos.gov.co. It includes case file details, subject matter, and filing dates.
NASA's EPOXI mission acquired raw narrow-band filter images of Mars on 20-21 November 2009. The High Resolution Visible CCD (HRIV) captured images across seven wavelengths from 350 to 950 nm during a 24-hour observing period. This data set was collected to characterize Mars as an analog for extrasolar planets.
A dataset of 73 sepsis patient records from an emergency department, used to construct a multilayer perceptron model for predicting 28-day mortality. The model was validated on independent MIMIC-III and MIMIC-IV datasets, achieving an accuracy of up to 92.0% and an AUC of 0.812. The data was authored by Qi Yun Gan and last updated in April 2026.
Monthly 1-degree gridded data provides estimates of water vapor in the marine atmospheric boundary layer beneath uniform cloud fields. The dataset is derived by combining total column water vapor from AMSR-E and AMSR-2 microwave radiometers with above-cloud water vapor from MODIS near-infrared imagery. This difference method isolates the vapor between the ocean surface and cloud top for analysis of lower atmospheric conditions.
Historical records of the Water Quality Risk Index for Human Consumption (IRCA) for the municipality of San Pedro de los Milagros, Colombia, from 2011 to 2023. The data includes both rural villages with aqueducts and the urban zone, detailing risk levels and water suitability for human consumption. It was published on the Colombian open data portal www.datos.gov.co to promote citizen oversight and informed decision-making for water service improvement.
A PDF data sheet describes a computational neuroscience study investigating pulvinar-inspired long-range skip connections in convolutional vision models. The author is Narmin Zarinabadi, and the file was last updated on May 14, 2026. The dataset is 539.6 KB in size and is licensed under CC-BY-4.0.
A research document describing experiments with a pulvinar-inspired long-range skip pathway in a hierarchical convolutional vision model. The model was evaluated on CIFAR-10 categorization and a near-threshold contrast-detection task with noisy backgrounds. The document was authored by Narmin Zarinabadi and last updated on 2026-05-14.
Annual reports from the Community Development Fund list successful recipients and the amounts awarded each year. The data is provided by the Government of Yukon and was last updated on June 3, 2026. The reports are available in ZIP and HTML formats under the OGL-CA-2.0 license.
Yukon wholesale liquor price lists for products carried by the Yukon Liquor Corporation. The data is published by the Government of Yukon and was last updated on June 3, 2026. These prices are intended for use by licensees only.
Measurements of voltage and current passing through a network of bistable resistive elements with negative differential resistance. The dataset is 13.3 MB in size and was authored by Lauren E. Altman. It was last updated on May 29, 2026.
Eight cities provided the geographic scope for this dataset of model fit statistics from a confirmatory factor analysis. It contains results from Phase 3 of a survey involving 3,791 respondents. The dataset was authored by Sheela S. Sinharoy and last updated in June 2026.
Academic year 2017-18 was the first award year for the Excelsior Scholarship program. This dataset shows the number of award recipients and award dollars by college for the program. The data is provided by data.ny.gov and was last updated on 2026-05-22.
A web service provides access to footprints for multiple imagery types covering New South Wales, Australia. It includes footprints for LANDSAT satellite imagery, standard 50cm orthorectified imagery, and high-resolution 10cm town imagery, plus other projects captured by AAM and Jacobs. The service is updated periodically when new imagery becomes available and is compliant with NSW FSDF specifications.
Budget execution data for the Municipality of Fusagasugá in 2024 details the expenditure process through commitments and payment orders. The dataset includes 28 columns tracking financial flows, such as ApropiacionDefinitiva, Obligaciones por pagar, Pago, and Compromisos. It originates from the Colombian open data portal, www.datos.gov.co, and was last updated in May 2026.
10,142 structured records from the Samuel & Audrey Media Network, created as part of Project 23, a long-term documentation effort focused on Argentina's 23 provinces. The archive contains 164 blog posts, 695 YouTube transcript records, 9,247 photography metadata rows, and 24 media reference records. Records are designed to support research, retrieval, and analysis of Argentina-focused travel documentation.
153 Parquet files contain English dialog and conversation data, totaling approximately 72.8 GB. The dataset was uploaded by user zuhri025 to Hugging Face and was last updated on May 31, 2026. Its structure suggests it is designed for efficient loading of large-scale text data.
A 2026 study by Peizhi Zhang integrates morning urine organic acid and inorganic ion profiles from 470 participants (232 calcium oxalate stone formers and 238 healthy controls). The dataset was used to develop a machine learning-based predictive model for nephrolithiasis, identifying five key urinary metabolites as potential biomarkers. The data is available as a supplementary document under a CC-BY-4.0 license.
A retrospective study of 710 patients (971 lesions) with clinically node-negative T1–T2 papillary thyroid carcinoma. The dataset integrates pathological, ultrasound features, thyroid function, and systemic inflammatory indicators to predict central lymph node metastasis. The model was developed by Yalin Zhu and last updated in April 2026.
Yalin Zhu's study presents a dataset for predicting central lymph node metastasis in clinically node-negative T1–T2 papillary thyroid carcinoma. The dataset integrates pathological, ultrasound features, thyroid function, and systemic inflammatory indicators from 710 patients (971 lesions). The model was developed using a gradient boosting decision tree and validated on an independent temporal cohort.
A retrospective study of 710 patients (971 lesions) with clinically node-negative T1–T2 papillary thyroid carcinoma, integrating pathological, ultrasound, thyroid function, and systemic inflammatory indicators. The dataset was used to develop an interpretable gradient boosting decision tree model for predicting central lymph node metastasis, achieving an AUC of 0.830 in the test set. The model was authored by Yalin Zhu and last updated on 2026-04-27.