Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
153,813 datasets
Annual averages from 2006 to 2015 present wage information for employees in Canada and its provinces. The dataset, a customization of Statistics Canada data, includes average hourly and weekly wage rates broken down by immigrant status, industry, type of work (full- and part-time), and sex. It is published by the Government of Alberta.
Annual average wage data from 2006 to 2015 for Canada and provinces, customized from Statistics Canada. It presents average hourly and weekly wage rates for employees categorized by type of work, immigrant status, industry, and sex. The dataset is provided by the Government of Alberta.
3,022,656 data points form the January 2002 edition of the first integrated onshore/offshore magnetic anomaly grid for the complete Australian margin. The grid covers 8°S to 52°S and 106°E to 172°E with a cell size of 0.01 degree, approximately 1 km, and values are in nanoTesla (nT). It was created by combining levelled and unlevelled marine data sectors with an earlier onshore grid, though mismatches exist at some onshore/offshore joins.
A 2022 inventory of information assets published by ICFES, the Colombian Institute for Educational Evaluation. The registry lists available records, their formats, and access points for public use. It is hosted on the Colombian open data portal, datos.gov.co.
United Kingdom public procurement notices published on the Contracts Finder portal for June 2026. The data is structured according to the Open Contracting Data Standard (OCDS) and provided in a flattened CSV format, with daily files containing all releases. This format likely includes details on tender opportunities, awards, and contract implementations.
Marco Vinicio Alban-Paccha published an eligibility matrix for healthy control cohorts in remote mobile app and wearable sensor sub-studies on figshare. The matrix details common inclusion criteria and sub-study-specific exclusions. The dataset is a 5.5 KB Excel file last updated on May 21, 2026.
Graham Elliott's replication dataset for the paper 'Combining Forecasts - On Why Averaging Beats Optimal Linear Weights'. The 168.3 MB collection includes code and data files supporting the analysis of the forecast combination puzzle. The dataset was last updated on 2026-04 27 and is shared under a CC-BY-4.0 license.
Individual monthly hillslope cover erosion rates, measured in tonnes per hectare per month, are provided for the state of New South Wales for the year 2017. The dataset is published by the NSW Department of Climate Change, Energy, the Environment and Water under a CC-BY-4.0 license. It was last updated on 2026-05-18.
A 2006 spectral library record for aquatic substrates from the Coringa Herald region, hosted in Australia's National Spectral Database. The data is part of a CSIRO research project on remote sensing for marine protected areas, with related publications from 2010 and 2013. Access is managed by Geoscience Australia.
5,625 finite element simulation records for a microstrip patch antenna sensor designed for non-invasive sweat electrolyte monitoring at 2.4 GHz. The dataset, created by Rakib, Sakhawat Hossen and hosted on Harvard Dataverse, was generated using COMSOL Multiphysics 6.3 and includes simulations across three substrate materials and five NaCl concentration levels. It was last updated on June 1, 2026.
REGDOC-2.6.3 sets out requirements and guidance from the Canadian Nuclear Safety Commission for managing the aging of structures, systems, and components in power reactor facilities. The document provides a framework for licensees to establish and implement aging management programs to ensure safety functions remain available throughout a facility's service life. It addresses both physical aging and obsolescence that could affect safe operation.
A 1999 Landsat ETM+ mosaic provides a land cover classification for Uruara, Para, Brazil, distinguishing forested from deforested areas. The single GeoTIFF image is designed to be overlaid with a cadastral property map of the same region from circa 1975. This dataset supports analysis of historical deforestation patterns in the Amazon Basin.
34.9 MB of PDF and DOCX files containing materials for introductory workshops for the Brazilian Informatics Olympiad (OBI) and the SBC Programming Marathon. The repository includes workshop plans, questionnaires, mock exams, and presentation slides used during the activities. It was authored by Bianca Araújo and last updated on May 28, 2026.
MASTER instrument data from the Western Diversity Time Series airborne campaign includes Level 1B and Level 2 products. The dataset contains calibrated radiance imagery across 50 spectral bands and derived land surface temperature and emissivity from nine NASA ER-2 flights over California and Nevada in spring 2021. It is provided by ORNL_CLOUD and serves as a benchmark for ecosystem state and change assessment.
Fifty spectral bands of calibrated radiance data were captured by the MODIS/ASTER Airborne Simulator (MASTER) instrument during five NASA ER-2 flights over California and Nevada from September 2-9, 2022. This dataset provides Level 1B georeferenced imagery at ~50-meter resolution and derived Level 2 products including land surface temperature and emissivity. Managed by ORNL_CLOUD, the data supports the Western Diversity Time Series campaign's goal of benchmarking ecosystems and monitoring natural disasters.
MASTER instrument Level 1B and Level 2 data products from the Western Diversity Time Series airborne campaign. The dataset contains georeferenced multispectral imagery across 50 spectral bands and derived land surface temperature and emissivity products, collected during four NASA ER-2 flights over California and Nevada in June 2024. It is managed by the ORNL_CLOUD organization.
Nine NASA ER-2 flights collected 50-band multispectral imagery over California from September 17 to October 15, 2020. The dataset provides Level 1B calibrated radiance and Level 2 derived products like land surface temperature and emissivity. Data from the MODIS/ASTER Airborne Simulator supports the Western Diversity Time Series program for ecosystem and disaster monitoring.
Gene expression panels for sex-specific bladder cancer biomarker discovery derived from RNA-seq data using machine learning. The dataset was created by Joseph R. Pizzi and published on figshare in April 2026. It contains results from applying four feature selection methods to identify robust gene signatures.
2021 Monthly Hillslope Cover Erosion provides monthly hillslope cover erosion rates in tonnes per hectare per month across New South Wales for the year 2021. The dataset is provided by the NSW Department of Climate Change, Energy, the Environment and Water and was last updated on 2026-05-18. Data is available in PDF and GEOTIFF file formats under a CC-BY-4.0 license.
Demo data for the development of SATINN, a neural network-based approach to analyzing mouse seminiferous images. The 1.1 GB archive contains sample test images and pre-trained neural networks for initial use. Author Ran Yang uploaded it to figshare on 2026-05-31 under a CC-BY-4.0 license.