Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
165,819 datasets
S-MODE Shipboard Radiometer Measurements Version 1 contains air-sea interaction data collected during the Sub-Mesoscale Ocean Dynamics Experiment pilot campaign. Air-Sea Interaction METeorology sensors on the R/V Oceanus recorded shortwave and longwave radiation fluxes approximately 300 km offshore of San Francisco over two weeks in October 2021. These measurements support the S-MODE mission to understand how small-scale ocean dynamics influence vertical exchanges of physical and biological variables.
Northern Saskatchewan and Manitoba data from the BOREAS project includes ceilometer measurements of cloud fraction, cloud height, and surface-based lifting condensation level. The National Aeronautics and Space Administration collected this information at the NSA-OJP site in 1994 and at both NSA-OJP and SSA-OBS sites in 1996. Data formats include HTML, PDF, PNG, BIN, ISO, ZIP, and TEXT files.
A database of registered and/or renewed SMEs from the last decade, provided by the Chamber of Commerce of Honda, Guaduas and North Tolima in response to a citizen request. The dataset includes 24 columns such as RAZON SOCIAL, IDENTIFICACION, ACTIVIDAD, and FEC-MATRICULA. It was last updated on the platform on 2026-05-18.
Marina Pavlova published a tabular dataset on figshare in June 2026. The dataset likely contains node degree metrics for a protein-protein interaction network derived from a full set of differentially expressed genes. The file is 20.8 KB in size.
A 6.7 KB CSV file contains a community similarity matrix calculated using the Sørensen similarity index. The dataset, authored by Seokmin Kim and last updated on 2026-05-27, focuses on species with a floating viability time of less than 7 days. It is shared under a CC-BY-4.0 license on the figshare platform.
A community similarity matrix calculated using the Sørensen similarity index. It focuses on species capable of floating for more than a median 95th percentile viability time of 7 days. The dataset is 6.4 KB in size, authored by Seokmin Kim, and was last updated on May 27, 2026.
Jeremy Oguamalam's research dataset from 2026 introduces a robust method for functional principal component analysis (FPCA) on relative data. The 75.7 MB collection includes files for simulations and real-world applications validating the proposed RRPCA method. Data is shared under a CC-BY-4.0 license on the figshare platform.
Canada's supply and disposition of milk products measured in tonnes. The data is published monthly by Statistics Canada. The dataset was last updated on 2026-05-28.
New York City data on fees assessed against properties by the Housing Preservation and Development department pursuant to the Housing Maintenance Code. The dataset includes columns such as FeeAmount, FeeIssuedDate, FeeType, and geographic identifiers like BBL, BIN, Longitude, and Latitude. It was last updated on 2026-05-28 and is hosted on the data.cityofnewyork.us platform via Socrata.
Use Permit Applications from the City of Orlando for activities like sidewalk cafés and right-of-way work. The dataset includes 30 columns tracking project details, reviews, and status from application to expiration. It is hosted on the city's open data portal and was last updated on May 28, 2026.
APEX float in-situ measurements of subsurface ocean properties were collected during the Sub-Mesoscale Ocean Dynamics Experiment (S-MODE) field campaign. The data, gathered approximately 300 km offshore of San Francisco in Spring 2023, aims to understand how short-scale ocean dynamics influence vertical exchange of physical and biological variables. The US Naval Oceanographic Office (NAVO) floats measured temperature and salinity, with data available in netCDF format.
Three years of continuous methane and carbon dioxide measurements collected at the 'Arcturus' station in the Bowen Basin, Australia, by Geoscience Australia and CSIRO Marine & Atmospheric Research. The dataset supports a simulation study analyzing the sensitivity of atmospheric techniques for detecting fugitive emissions from a simulated coal seam gas field against a baseline. Results, including an indicative minimum detectable emission rate, were presented at the American Geophysical Union meeting in December 2013.
Colombia's territorial entities and regional funds are tracked for their current revenue collection against the Biennial Cash Plan projections. The dataset includes columns for month, department, fund, and accumulated amounts, sourced from the national open data portal. It was last updated on May 26, 2026.
A list of public servants with active employment in the Colombian Public Employment Information and Management System (SIGEP) who hold positions of trust, budgetary management, and free appointment and removal. The dataset includes 27 columns such as SEXO, NOMBRE_INSTITUCION, MESES_EXPERIENCIA_PUBLICO, and NIVEL_JERARQUICO_EMPLEO. It was published by datos.gov.co and last updated on 2026-05-18.
SoE2020: Local heritage places and areas is a dataset from the Queensland government's Department of Environment, Tourism, Science and Innovation. It likely contains records of heritage sites identified and protected by local governments through planning schemes. The dataset is available under a CC-BY-4.0 license and was last updated on 2026-05-27.
Queensland, Australia, has 13 places listed on the National Heritage List as of 2018. The dataset, published by the Queensland Department of Environment, Tourism, Science and Innovation, includes Quinkan Country on Cape York Peninsula, which was added in 2018. It was last updated on 2026-05-27.
Local governments in Queensland identify and protect local heritage places and areas through planning schemes. The dataset is provided by the Queensland Department of Environment, Tourism, Science and Innovation under a CC-BY-4.0 license. It was last updated on 2026-05-27.
Twelve Queensland places are listed on Australia's National Heritage List. The dataset is provided by the Queensland Department of Environment, Tourism, Science and Innovation and was last updated in May 2026. It likely contains information about these heritage-listed sites.
NASA's VIIRS/JPSS2 VJ201_NRT product delivers unpacked, raw sensor counts from the Visible Infrared Imaging Radiometer Suite aboard the JPSS-2 satellite. This Level 1A data includes science, calibration, and engineering data, along with extracted spacecraft ephemeris, attitude, and telemetry. Its near-real-time 6-minute swath format provides foundational radiance measurements for downstream environmental monitoring.
Colombian consulates worldwide processed transactions, with details on type, location, and applicant profiles. The dataset includes columns for civil status, age group, month, office, transaction type, year, academic level, number of transactions processed, office code, transaction code, gender, and transaction description. It was published by www.datos.gov.co and last updated on 2026-05-18.