Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
166,360 datasets
A legacy workshop report from the Intergovernmental Oceanographic Commission (IOC) focusing on benthic microbes and coral reefs. The report was produced from a workshop held in Townsville, Australia, in August 1988 and is hosted by the Australian Ocean Data Network. The dataset consists of HTML and PDF files, but the abstract, sample data, and column details are unavailable.
A historical report from the Australian Ocean Data Network details geological work conducted during the relief voyage of the M.S. Thala Dan between December 1961 and March 1962. The dataset is published on data_gov_au and was last updated in 2026. The original abstract is unavailable, and the data is provided in HTML and PDF formats.
Vema cruise 33, leg 2, documented oceanographic observations in the southeast Indian Ocean from 21 December 1975 to 17 January 1976. The dataset is an observer's report published by the Australian Ocean Data Network and last updated on 2026-06-27. Available file formats include HTML and PDF.
A directory of EPS, IPS, and health centers in Yopal Municipality, Colombia, updated as of May 2026. The dataset includes basic contact information and geospatial coordinates for each entity. It is published by the Colombian open data platform www.datos.gov.co.
Potential water supply volumes in megalitres per day from various resource options under different future scenarios. The data, provided by the UK Environment Agency, models water availability at national, regional, and company scales for the 2050s. It includes projections for 'Do Nothing', 'Low', 'Central', and 'High' national framework scenarios.
26 columns suggest a detailed record of municipal code enforcement actions, including case status, inspections, and hearings. The dataset includes geospatial coordinates (XPos, YPos) and property identifiers (GeoPIN), linking violations to specific locations. It is published by the City of New Orleans and appears on multiple government data platforms.
Geoscience Australia Data presents seabed bathymetry compilations for the Australian Antarctic margin, derived from multibeam, singlebeam, and ETOPO2 satellite data. The flythrough reveals complex seabed environments, particularly off the Davis coastline, and includes images of seabed communities for the George V margin and Davis coastline. This data was last updated on 2026-05-14.
A curated dataset for binary text classification focused on identifying Moroccan Darija (Arabic script or Arabizi) versus other dialects, Modern Standard Arabic, or languages like English. The dataset was built from multiple public sources, cleaned, and taxonomized by author atlasia. It was last updated on 2026-06-07.
Sudip Saha published a dataset comparing recent state-of-the-art adversarially robust Network Intrusion Detection System (NIDS) models. The dataset is a 5.5 KB Excel file, last updated on June 1, 2026. It is shared under a CC-BY-4.0 license on the figshare platform.
A 2024β2026 summary of research on adversarially robust Network Intrusion Detection Systems (NIDS) and defense approaches. The dataset is a 9.5 KB Excel file authored by Sudip Saha and shared under a CC-BY-4.0 license on figshare. It was last updated on June 1, 2026.
As of April 2026, this dataset lists all cannabinoid hemp licenses issued by the New York State Office of Cannabis Management. It provides details on licensed entities, their locations, license status, and validity periods. The data is available in multiple formats and is hosted on official state data platforms.
NASA's dataset provides 30-meter resolution maps of Cajander larch aboveground biomass circa 2007 and perimeters for 116 forest fires from 1966-2007. The data covers approximately 100,000 square kilometers of the Kolyma River Basin in northeastern Siberia, Russia. It combines biomass estimates with a multi-decadal fire history for a key boreal forest region.
A document titled 'Notes on interfacing electronic equipment with special reference to the 1977 marine data acquisition system'. It is a legacy product from the Australian Ocean Data Network with no abstract available. The dataset was last updated on 2026-06-27 17:14:36.551615.
Ocean Drilling Program Leg 182 data focuses on Cenozoic cool-water carbonates from the Great Australian Bight. The dataset is published on data_gov_au by the Australian Ocean Data Network and was last updated in June 2026. It likely contains safety and pollution prevention documentation from a May 1997 panel.
BMR proposals for cooperation with the Woods Hole Institution concerning the east Indian Ocean from 1975. The dataset is published by the Australian Ocean Data Network on data_gov_au. It is a legacy product for which no abstract is available.
Legacy product from the Australian Ocean Data Network with no abstract available. The report likely contains findings from a joint Australian-Japanese marine geological expedition conducted in the Arafura Sea. It was published on the data_gov_au platform and last updated in June 2026.
Lower Cretaceous microfossils recovered from a bore on the Plenty River. Their presence demonstrates a westerly extension of the marine Lower Cretaceous below the sand cover of the Simpson Desert. The dataset is provided by the Australian Ocean Data Network.
SIVIGILA 2018 contains records from Colombia's National Public Health Surveillance System, designed to provide systematic and timely information on events affecting population health. The dataset includes columns such as ANO (year), SEMANA (week), COD_EVE (event code), and conteo_casos (case count). It is hosted by datos.gov.co and was last updated on 2026-05-18.
Approximately 300 km offshore of San Francisco, the Sub-Mesoscale Ocean Dynamics Experiment (S-MODE) deployed three wave gliders during a pilot campaign in October 2021 and intensive periods in Fall 2022 and Spring 2023. The dataset contains high-frequency sensor observations, including sonic anemometers, radiometers, CTD profilers, ADCPs, and a 20Hz IMU, to study sub-mesoscale influences on vertical exchange. Data are provided in netCDF format.
A presentation service provides coordinate networks for the territory of the Federal Republic of Germany. The coordinate grids are displayed as a regular grid with a scale-dependent grid width. The data is provided by the Bundesamt fΓΌr Kartographie und GeodΓ€sie under the Data licence Germany β attribution β Version 2.0.