Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
165,609 datasets
Property tax records for Guadalajara de Buga, Colombia, spanning over six decades from 1960. The dataset includes detailed information on land and property taxes paid by owners. It is hosted by datos.gov.co and was last updated in May 2026.
A 5.5 KB Excel dataset presents results from a unified framework for evaluating machine learning-based Intrusion Detection Systems (IDS). The framework harmonizes features from the NSL-KDD and CICIDS2017 datasets and benchmarks models including Random Forest, which achieved 98.0% accuracy and 97.0% F1-score. Authored by Shailendra Mishra and last updated on April 20, 2026, this work focuses on reproducibility and statistical validation in cybersecurity research.
Shailendra Mishra's evaluation metrics reporting summary, published on figshare in April 2026. The 5.5 KB XLS file contains results from a unified framework for evaluating Intrusion Detection Systems (IDS). The framework harmonized features from the NSL-KDD and CICIDS2017 datasets and benchmarked supervised, unsupervised, deep learning, and ensemble models.
5.5 KB of statistical test results from a framework evaluating machine learning models for network intrusion detection. The dataset, authored by Shailendra Mishra and last updated in April 2026, contains results from Wilcoxon signed-rank, McNemar’s, and DeLong tests applied to models like Random Forest on harmonized NSL-KDD and CICIDS2017 datasets.
Shailendra Mishra's framework harmonizes features from the NSL-KDD and CICIDS2017 network intrusion datasets for evaluating machine learning models. The dataset, last updated in April 2026, is a 5.5 KB Excel file containing the harmonized data used in the study. Experimental results from the framework demonstrated a Random Forest model achieving 98.0% accuracy and 97.0% F1-score on this data.
A 5.5 KB dataset from figshare, last updated on 2026-04-20, containing results from an ablation study on machine learning models for intrusion detection. The work by Shailendra Mishra proposes a unified framework, harmonizing the NSL-KDD and CICIDS2017 datasets and benchmarking models including Random Forest, which achieved 98.0% accuracy and 97.0% F1-score.
A 5.5 KB Excel file containing harmonized features from two network intrusion datasets, NSL-KDD and CICIDS2017, for evaluating machine learning models. The dataset was created by Shailendra Mishra and last updated on April 20, 2026. It supports a framework for reproducible and statistically validated benchmarking of Intrusion Detection Systems.
Cross-validation results from a framework evaluating machine learning models for network intrusion detection. The dataset contains performance metrics from models like Random Forest, which achieved 98.0% accuracy and 97.0% F1-score on harmonized data. The work by Shailendra Mishra was last updated in April 2026.
A 5.5 KB Excel dataset created by Shailendra Mishra and last updated on April 20, 2026. It contains harmonized features from the NSL-KDD and CICIDS2017 network intrusion datasets, processed through a unified framework for evaluating machine learning-based Intrusion Detection Systems (IDS). The work includes results from benchmarking supervised, unsupervised, deep learning, and ensemble models.
Shailendra Mishra's framework evaluates Intrusion Detection Systems (IDS) using harmonized features from the NSL-KDD and CICIDS2017 datasets. The work benchmarks supervised, unsupervised, deep learning, and ensemble models, reporting a Random Forest model achieving 98.0% accuracy and 97.0% F1-score on the harmonized data. The dataset, last updated in April 2026, is a 5.5 KB Excel file detailing experimental results and trade-offs.
The Lord Howe Rise Project is a collaborative research project between Geoscience Australia and the Japan Agency for Marine-Earth Science and Technology (JAMSTEC). It aims to better understand the geology, tectonics and paleoenvironment of the central Lord Howe Rise, which is part of northern Zealandia. The dataset is represented by a project flyer from 2018, hosted by the Australian Ocean Data Network.
Registro de Activos de Información is an inventory of public information assets managed by Colombian government entities, including the Ministry of Foreign Affairs and the Departmental Institute of Fine Arts. The dataset catalogs information assets with details on their description, language, physical medium, designated owner, and conservation method. It likely contains metadata for tracking and managing official documents and records across public administration.
The Municipal Performance Measurement (MDM) dataset aims to measure and compare municipal management and development outcomes across the Department of Magdalena, Colombia. It includes scores for education, health, security, services, and governance, adjusted for initial municipal capacities. The data is hosted by datos.gov.co and was last updated on 2026-05-18.
Geospatial data identifies Food Production Protection Zones in the Sabana Centro province of Cundinamarca, Colombia. The dataset covers 11 municipalities prioritized by the Ministry of Agriculture and Rural Development. It includes columns for municipality, geometry, department, area in hectares, and administrative codes.
A compilation video showing bathymetric data related to MH370 and Geoscience Australia staff at work. The video is provided by the Australian Ocean Data Network and was last updated on June 16, 2026. It is intended primarily for use by media organizations.
Hugging Face hosts the Image Moderation Specialist dataset, created by cloverxion and last updated in June 2026. It exclusively contains highly sensitive, explicit, violent, and weaponized adult imagery. This collection is engineered for AI safety engineering and institutional academic research.
CIPHER encoding matrix maps genes to 18-bit codes via probe counts. The dataset is 920.6 KB in size and was authored by Zachary Hemminger. It was last updated on June 4, 2026.
Gene Ontology enrichment terms for genes assigned to each bit of a final CIPHER encoding matrix. The dataset is a 2.8 MB CSV file authored by Zachary Hemminger and last updated on June 4, 2026. It is shared under a CC-BY-4.0 license on the figshare platform.
SWOT Level 1B data provides interferograms from nine distinct Doppler beams, corrected for on-board processing phase biases. Each netCDF-4 file contains gridded, spatially averaged measurements and geometry data for a full satellite swath per half-orbit. This dataset is produced by NASA's On Board Processor and corrected on the ground for use in advanced hydrological and oceanographic analysis.
Hourly surface temperature data recorded by Onset HOBO Pro v2 sensors buried a few centimeters below ground. The data originates from the Tasiapik Valley near Umiujaq in Nunavik, Quebec, Canada, with temperatures provided in degrees Celsius. The dataset was authored by Richard Fortier and is hosted on the Borealis Harvested Dataverse platform.