Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
165,105 datasets
A historic record of Land Act Crown grants issued between April 1, 2000 and March 31, 2009. Crown grants are instruments used to permanently convey Crown land under fee simple title, including direct sales, sponsored grants, lease to purchase, and land exchanges. This data was sourced from TANTALIS and used to inform the Ministry of Forests, Lands and Natural Resource Operations Crown Land Indicators and Statistics report.
19-131 Reef sea water temperature data collected by deployed loggers from 12 March 2011 to 17 April 2026. The dataset is provided by the Australian Ocean Data Network and was last updated on 4 June 2026.
Australian Ocean Data Network collected sea water temperature data from loggers deployed at 19-138 Reef. The time series spans from 14 March 2011 to 21 April 2026. The dataset was last updated on 4 June 2026.
A research paper comparing multilayer perceptron neural networks to logistic regression for optimizing steel structure design. The study evaluates different ANN configurations to find effective arrangements for reducing weight, costs, and environmental impact. The work was authored by Amirhossein Ostovar and published on figshare under a CC-BY-4.0 license.
Puntos de Conectividad Itagüí Inteligente Digital maps the exact locations of free community Wi-Fi access points in the Colombian municipality of Itagüí. The dataset is published by the Colombian open data portal www.datos.gov.co and was last updated on May 18, 2026. It likely contains point coordinates and descriptive information for each connectivity hub.
From 23 August 2006 to 07 May 2026, temperature loggers collected sea water data at Lady Elliot Reef on the Great Barrier Reef. The dataset is provided by the Australian Ocean Data Network and was last updated on 4 June 2026.
Australian Ocean Data Network collected this sea water temperature dataset from one or more loggers deployed around the site of 19-159 Reef. The data spans a 15-year period from 20 March 2011 to 20 April 2026. It was last updated on the platform on 4 June 2026.
Medicaid claims data from New York State covering substance use disorder and other services. The profile includes summary figures for recipients, paid claims, and dollars spent, sourced from data.ny.gov. The dataset was last updated on May 22, 2026.
A 2020 procurement plan from a Colombian public entity, published via datos.gov.co. The dataset includes planned acquisitions with details on selection modality, estimated value, and contract duration. The data is informational and does not represent a binding commitment by the state entity.
18 columns track actions executed on treaties presented to the Congress of the Republic. The dataset includes fields such as FECHA DIARIO, LEY NÚMERO, TÍTULO, and ESTADO ACCIÓN. It is hosted by www.datos.gov.co and was last updated on 2026-05-18.
30 GeoTIFF files provide land cover mapping for Xinjiang's oases across six epochs from 2000 to 2024 at 5-year intervals. Each year includes five maps: one annual composite and four seasonal products for spring, summer, autumn, and winter. The dataset was authored by Lu Chen and published on figshare in June 2026.
Service Delivered metrics from the MTA Metro-North Railroad track scheduled versus actual train operations. The data details monthly counts of trains that were Cancelled, Terminated, Scheduled, and Actual across different Service Territories and Lines. It is published by data.ny.gov and was last updated in May 2026.
A 2025 annual procurement plan from the municipality of Santa Rosa De Cabal, Colombia, published on datos.gov.co. The plan lists estimated contract details for goods, works, and services, with the disclaimer that items may be canceled or modified. The dataset was last updated in May 2026.
9.0 KB of data in an XLSX file, showing the overlap between HRGs and LRGs predicted by the MOGT tool on two genomic window sizes. The dataset was authored by Jiafang Li and last updated on 2026-05-29. It is shared under a CC-BY-4.0 license on figshare.
Underlying data used for all analyses in a study by Sorachai Kamollimsakul. The dataset is a 37.3 KB XLSX file, last updated on 2026-05-29. It is published under a CC-BY-4.0 license and is also available on Mendeley Data.
5.5 KB of data in XLS format, uploaded by Takehiro Kosaka to figshare. The dataset contains correlation coefficients between a specific power ratio (50%&70%/0%&10%) and ratios of combinations among other loads. It was last updated on 2026-05-29.
8.6 KB of gene panel data identifies sex-specific biomarkers for bladder cancer. The dataset results from applying four machine learning feature selection methods to gender-stratified RNA-seq data, achieving areas under the ROC curve of 0.932 and 0.914 for male and female panels, respectively. Authored by Joseph R. Pizzi and last updated on 2026-04-29, it is shared under a CC-BY-4.0 license.
A gene panel dataset resulting from a machine learning study on sex-stratified bladder cancer RNA-seq data. The study applied four feature selection methods to identify biomarkers, with male and female-specific panels achieving areas under the ROC curve of 0.932 and 0.914, respectively, on unseen data. The dataset was authored by Joseph R. Pizzi and last updated on 2026-04-29.
A gene panel dataset derived from gender and disease-stratified RNA-seq data to identify sex-specific bladder cancer biomarkers. The data was generated by Joseph R. Pizzi using machine learning feature selection methods and was last updated on 2026-04-29. Male and female-specific panels achieved areas under the ROC curve of 0.932 and 0.914, respectively, for distinguishing cancer from non-tumor controls.
Machine learning techniques identified sex-specific gene panels for bladder cancer diagnosis. Joseph R. Pizzi published this 19.4 KB CSV dataset on figshare in April 2026. Male and female-specific panels achieved AUC scores of 0.932 and 0.914 respectively on unseen data.