Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
153,224 datasets
Marina Pavlova published a tabular dataset on figshare in June 2026. The dataset likely contains node degree metrics for a protein-protein interaction network derived from a full set of differentially expressed genes. The file is 20.8 KB in size.
A 6.7 KB CSV file contains a community similarity matrix calculated using the Sørensen similarity index. The dataset, authored by Seokmin Kim and last updated on 2026-05-27, focuses on species with a floating viability time of less than 7 days. It is shared under a CC-BY-4.0 license on the figshare platform.
A community similarity matrix calculated using the Sørensen similarity index. It focuses on species capable of floating for more than a median 95th percentile viability time of 7 days. The dataset is 6.4 KB in size, authored by Seokmin Kim, and was last updated on May 27, 2026.
Jeremy Oguamalam's research dataset from 2026 introduces a robust method for functional principal component analysis (FPCA) on relative data. The 75.7 MB collection includes files for simulations and real-world applications validating the proposed RRPCA method. Data is shared under a CC-BY-4.0 license on the figshare platform.
Canada's supply and disposition of milk products measured in tonnes. The data is published monthly by Statistics Canada. The dataset was last updated on 2026-05-28.
New York City data on fees assessed against properties by the Housing Preservation and Development department pursuant to the Housing Maintenance Code. The dataset includes columns such as FeeAmount, FeeIssuedDate, FeeType, and geographic identifiers like BBL, BIN, Longitude, and Latitude. It was last updated on 2026-05-28 and is hosted on the data.cityofnewyork.us platform via Socrata.
Use Permit Applications from the City of Orlando for activities like sidewalk cafés and right-of-way work. The dataset includes 30 columns tracking project details, reviews, and status from application to expiration. It is hosted on the city's open data portal and was last updated on May 28, 2026.
APEX float in-situ measurements of subsurface ocean properties were collected during the Sub-Mesoscale Ocean Dynamics Experiment (S-MODE) field campaign. The data, gathered approximately 300 km offshore of San Francisco in Spring 2023, aims to understand how short-scale ocean dynamics influence vertical exchange of physical and biological variables. The US Naval Oceanographic Office (NAVO) floats measured temperature and salinity, with data available in netCDF format.
Three years of continuous methane and carbon dioxide measurements collected at the 'Arcturus' station in the Bowen Basin, Australia, by Geoscience Australia and CSIRO Marine & Atmospheric Research. The dataset supports a simulation study analyzing the sensitivity of atmospheric techniques for detecting fugitive emissions from a simulated coal seam gas field against a baseline. Results, including an indicative minimum detectable emission rate, were presented at the American Geophysical Union meeting in December 2013.
Colombia's territorial entities and regional funds are tracked for their current revenue collection against the Biennial Cash Plan projections. The dataset includes columns for month, department, fund, and accumulated amounts, sourced from the national open data portal. It was last updated on May 26, 2026.
A list of public servants with active employment in the Colombian Public Employment Information and Management System (SIGEP) who hold positions of trust, budgetary management, and free appointment and removal. The dataset includes 27 columns such as SEXO, NOMBRE_INSTITUCION, MESES_EXPERIENCIA_PUBLICO, and NIVEL_JERARQUICO_EMPLEO. It was published by datos.gov.co and last updated on 2026-05-18.
SoE2020: Local heritage places and areas is a dataset from the Queensland government's Department of Environment, Tourism, Science and Innovation. It likely contains records of heritage sites identified and protected by local governments through planning schemes. The dataset is available under a CC-BY-4.0 license and was last updated on 2026-05-27.
Queensland, Australia, has 13 places listed on the National Heritage List as of 2018. The dataset, published by the Queensland Department of Environment, Tourism, Science and Innovation, includes Quinkan Country on Cape York Peninsula, which was added in 2018. It was last updated on 2026-05-27.
Local governments in Queensland identify and protect local heritage places and areas through planning schemes. The dataset is provided by the Queensland Department of Environment, Tourism, Science and Innovation under a CC-BY-4.0 license. It was last updated on 2026-05-27.
Twelve Queensland places are listed on Australia's National Heritage List. The dataset is provided by the Queensland Department of Environment, Tourism, Science and Innovation and was last updated in May 2026. It likely contains information about these heritage-listed sites.
NASA's VIIRS/JPSS2 VJ201_NRT product delivers unpacked, raw sensor counts from the Visible Infrared Imaging Radiometer Suite aboard the JPSS-2 satellite. This Level 1A data includes science, calibration, and engineering data, along with extracted spacecraft ephemeris, attitude, and telemetry. Its near-real-time 6-minute swath format provides foundational radiance measurements for downstream environmental monitoring.
Colombian consulates worldwide processed transactions, with details on type, location, and applicant profiles. The dataset includes columns for civil status, age group, month, office, transaction type, year, academic level, number of transactions processed, office code, transaction code, gender, and transaction description. It was published by www.datos.gov.co and last updated on 2026-05-18.
Local governments in Queensland identify and protect local heritage places and areas through planning schemes. The dataset was published by the Queensland Department of Environment, Tourism, Science and Innovation under a CC-BY-4.0 license. It was last updated on 2026-05-27.
Gondwana Rainforests of Australia meets three World Heritage natural criteria. The dataset is provided by the Queensland Department of Environment, Tourism, Science and Innovation under a CC-BY-4.0 license. Its last metadata update was recorded on 2026-05-27.
2001 forest cover data for all of South America derived from MODIS satellite imagery. The dataset contains a single GeoTIFF file where pixels are classified as forest, non-forest, or water based on a 40% canopy cover threshold. It was produced by the MODIS science team using the TERRA satellite platform.