Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
150,784 datasets
UNICEF data measures the percentage of children born in the last 24 months who were fed exclusively with breast milk for the first two days after birth. This indicator is relevant for monitoring child nutrition and public health outcomes globally. The dataset is available in CSV and XML formats under a Creative Commons license.
CIPHER encoding matrix maps genes to 18-bit codes via probe counts. The dataset is 920.6 KB in size and was authored by Zachary Hemminger. It was last updated on June 4, 2026.
Gene Ontology enrichment terms for genes assigned to each bit of a final CIPHER encoding matrix. The dataset is a 2.8 MB CSV file authored by Zachary Hemminger and last updated on June 4, 2026. It is shared under a CC-BY-4.0 license on the figshare platform.
SWOT Level 1B data provides interferograms from nine distinct Doppler beams, corrected for on-board processing phase biases. Each netCDF-4 file contains gridded, spatially averaged measurements and geometry data for a full satellite swath per half-orbit. This dataset is produced by NASA's On Board Processor and corrected on the ground for use in advanced hydrological and oceanographic analysis.
Australian iron ore resource estimates compiled by Geoscience Australia. The data includes approximately 400 million tons of demonstrated reserves, 200 million tons of inferred reserves, and a large tonnage of latent resources. Production has historically come from the Middleback Ranges in South Australia and Yampi Sound in Western Australia.
Ali Alqazzaz published this dataset on 2026-04-17. It contains results for the MedDefender-MHAN explainable multi-head attention network, an intrusion detection system designed for healthcare Internet of Things (IoMT) environments. The data likely includes performance metrics from evaluations on the CICIDS2017 and TON_IoT benchmark datasets.
Ali Alqazzaz published ablation study results on the CICIDS2017 benchmark dataset on April 17, 2026. The 5.5 KB Excel file contains results from evaluating the MedDefender-MHAN explainable multi-head attention network for intrusion detection in healthcare IoT environments. The model achieved a reported detection accuracy of 99.47% on CICIDS2017.
Ali Alqazzaz published a dataset on 2026-04-17 containing multi-dimensional comparison results for the MedDefender-MHAN intrusion detection model. The data, stored in an XLS file of 5.5 KB, presents evaluation metrics from tests on CICIDS2017 and TON_IoT benchmark datasets. It includes reported detection accuracies of 99.47% and 98.92%, inference latency, throughput, and explainability alignment scores.
Ali Alqazzaz published a dataset on figshare in April 2026. The dataset likely contains performance metrics for intrusion detection system models, specifically focusing on runtime efficiency. It is associated with research on MedDefender-MHAN, an explainable multi-head attention network for healthcare IoT threat detection.
Per-class detection performance metrics for the CICIDS2017 cybersecurity benchmark dataset, as evaluated in a 2026 study. The dataset, authored by Ali Alqazzaz and shared on figshare, likely contains accuracy or other classification metrics for different attack types. It is a small Excel file (5.5 KB) used to validate the MedDefender-MHAN explainable AI model for healthcare IoT security.
Ali Alqazzaz provides a summary of datasets used to evaluate the MedDefender-MHAN explainable intrusion detection system for healthcare IoT. The summary, last updated in April 2026, is a 5.5 KB Excel file. It references evaluation results on the CICIDS2017 and TON_IoT benchmark datasets.
Cycle threshold (Ct) values and descriptive statistics for 7 candidate endogenous referent genes assayed across 14 samples from Leucosolenia corallorrhiza tissues. The dataset is provided by author Kseniia V. Skorentseva and was last updated on 2026-05-21. It is a small, 5.5 KB Excel file shared under a CC-BY-4.0 license.
Marine dolostones from the Todd River Dolomite and Mount Baldwin Formation preserve a limited archaeocyathan-radiocyathan fauna. The dataset documents fossil species including Aldanocyathus greeni Kruse sp. nov. and Aruntacyathus toddi Kruse gen. et sp. nov. This data from Geoscience Australia correlates these central Australian faunas with South Australian limestone formations and Siberian stages.
ICFES's Information Assets Registry inventories all information, published records, and records available to the public, establishing a guarantee of access to published information. The dataset includes columns describing content categories, availability, preservation media, publication location, and format. It was last updated on 2026-05-18.
Version 2 data from 2008, updated with a batch of tiles in 2012, identifies 167 landscape areas as polygons attributed with geological names related to mass movement. The British Geological Survey (BGS) created this data at a 1:25,000 scale, covering selected 'classic' geology areas like Llandovery, Coniston, and the Cuillan Hills. It includes deposits that have moved downslope (landslips) as well as foundered strata where ground has collapsed due to subsidence.
Mass movement version 7 identifies landscape areas across Great Britain attributed with types of mass movement, such as landslips. The data covers onshore England, Wales, Scotland, and the Isle of Man at a 1:50,000 scale and is provided by the British Geological Survey (BGS). It includes foundered strata not described in the standard rock classification scheme, but caution is advised as historical recording may be incomplete and the landscape is dynamic.
Magdalena Department in Colombia provides data on the number of students enrolled in official and private schools across its municipalities from 2010 to 2020. The dataset excludes the municipalities of Santa Marta and Ciénaga. It includes columns for official, private, and contracted schools, as well as subregion and municipality codes.
Weiren Wang published a dataset of annotated stress-strain curves on figshare in June 2026. The dataset includes figure titles, axis labels, sample identifiers, and point coordinates, and is packaged in a 64.5 MB RAR file. It is shared under a CC-BY-4.0 license.
21.5 KB of data on the frequency of current use of specific wearable devices among users, shared by André Hajek on figshare. The dataset was last updated on June 2, 2026, and is available under a CC-BY-4.0 license in XLS format.
Summary statistics for geometric characteristics and simulation outputs, stratified by sex and heart failure status. The dataset is a 20.7 KB XLSX file authored by José Alonso Solís-Lemus and last updated on June 2, 2026. It is licensed under CC-BY-4.0 and hosted on figshare.