Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
157,441 datasets
19.9% of 221 cervical cancer patients developed symptomatic pelvic lymphocele after pelvic lymphadenectomy at a Chinese hospital between January 2023 and September 2024. Yiyue Wang created this dataset to develop interpretable machine learning models, with the K-Nearest Neighbors model achieving an AUC of 0.952 on the training set. The data includes clinical characteristics and laboratory results used to identify key predictive features like diabetes and tumor size.
Table 1_An interpretable machine learning model for predicting symptomatic pelvic lymphocele after pelvic lymphadenectomy in cervical cancer.xls contains clinical and laboratory data from 221 patients with cervical cancer. The data was collected at the Affiliated Hospital of North Sichuan Medical College between January 2023 and September 2024. Author Yiyue Wang published the dataset on figshare under a CC-BY-4.0 license.
19.9% of the 221 cervical cancer patients studied developed symptomatic pelvic lymphocele after surgery. The dataset contains clinical characteristics and laboratory data from a retrospective analysis at the Affiliated Hospital of North Sichuan Medical College, collected between January 2023 and September 2024. It was used to develop and validate interpretable machine learning models for predicting this surgical complication.
Kexin Zhang published a 6.6 MB dataset on figshare in 2026 to support manuscript reproducibility. The data includes processed numerical simulation and experimental results for structural health monitoring of offshore wind turbine towers. It contains finite element modal frequency data, axial crack indicators, annular crack indicators, flange bolt fracture indicators, experimental frequency data, and experimental SSMR indicator data.
A directory of EPS, IPS, and health centers in Yopal Municipality, Colombia, updated as of May 2026. The dataset includes basic contact information and geospatial coordinates for each entity. It is published by the Colombian open data platform www.datos.gov.co.
Potential water supply volumes in megalitres per day from various resource options under different future scenarios. The data, provided by the UK Environment Agency, models water availability at national, regional, and company scales for the 2050s. It includes projections for 'Do Nothing', 'Low', 'Central', and 'High' national framework scenarios.
26 columns suggest a detailed record of municipal code enforcement actions, including case status, inspections, and hearings. The dataset includes geospatial coordinates (XPos, YPos) and property identifiers (GeoPIN), linking violations to specific locations. It is published by the City of New Orleans and appears on multiple government data platforms.
Geoscience Australia Data presents seabed bathymetry compilations for the Australian Antarctic margin, derived from multibeam, singlebeam, and ETOPO2 satellite data. The flythrough reveals complex seabed environments, particularly off the Davis coastline, and includes images of seabed communities for the George V margin and Davis coastline. This data was last updated on 2026-05-14.
A curated dataset for binary text classification focused on identifying Moroccan Darija (Arabic script or Arabizi) versus other dialects, Modern Standard Arabic, or languages like English. The dataset was built from multiple public sources, cleaned, and taxonomized by author atlasia. It was last updated on 2026-06-07.
Sudip Saha published a dataset comparing recent state-of-the-art adversarially robust Network Intrusion Detection System (NIDS) models. The dataset is a 5.5 KB Excel file, last updated on June 1, 2026. It is shared under a CC-BY-4.0 license on the figshare platform.
A 2024–2026 summary of research on adversarially robust Network Intrusion Detection Systems (NIDS) and defense approaches. The dataset is a 9.5 KB Excel file authored by Sudip Saha and shared under a CC-BY-4.0 license on figshare. It was last updated on June 1, 2026.
As of April 2026, this dataset lists all cannabinoid hemp licenses issued by the New York State Office of Cannabis Management. It provides details on licensed entities, their locations, license status, and validity periods. The data is available in multiple formats and is hosted on official state data platforms.
NASA's dataset provides 30-meter resolution maps of Cajander larch aboveground biomass circa 2007 and perimeters for 116 forest fires from 1966-2007. The data covers approximately 100,000 square kilometers of the Kolyma River Basin in northeastern Siberia, Russia. It combines biomass estimates with a multi-decadal fire history for a key boreal forest region.
Lower Cretaceous microfossils recovered from a bore on the Plenty River. Their presence demonstrates a westerly extension of the marine Lower Cretaceous below the sand cover of the Simpson Desert. The dataset is provided by the Australian Ocean Data Network.
SIVIGILA 2018 contains records from Colombia's National Public Health Surveillance System, designed to provide systematic and timely information on events affecting population health. The dataset includes columns such as ANO (year), SEMANA (week), COD_EVE (event code), and conteo_casos (case count). It is hosted by datos.gov.co and was last updated on 2026-05-18.
Bushcare Groups data is published by the City of Hobart Open Data platform. The dataset likely contains information on community-led environmental volunteer groups active in the Hobart area. Its availability in multiple geospatial formats suggests it can be used for mapping group locations and activities.
2-meter contour data for the City of Hobart, Australia, provides detailed topographic elevation information. The dataset is published by the Hobart City Council's GIS team and is available in multiple geospatial formats. Its cross-platform presence indicates it is a maintained public resource for local geospatial analysis.
Hobart City Council provides geospatial data on designated off-leash dog exercise areas within the municipality. The dataset is published as open data and is available in multiple formats, including CSV and GeoJSON. Its primary purpose is to inform residents and visitors about locations where dogs can be exercised.
Drinking Fountains View is a geospatial dataset listing public drinking fountains in the City of Hobart, Australia. The data is provided by the Hobart City Council as part of its open data initiative. It is available in multiple formats, including GeoJSON, CSV, and KML, suggesting it contains location coordinates and likely attributes for each fountain.
Contour 0.5m data provides detailed elevation information for the City of Hobart, Australia. The dataset is maintained by HCCGISICT and is available in multiple geospatial formats, including GeoJSON, KML, and via an ArcGIS REST API. Its cross-platform presence on data.gov.au suggests it is a core geospatial resource for the region.