Loading...
Loading...
Offline RL trajectories, game data, robot demonstrations, RLHF, multi-agent interaction
10,001 datasets
A synthetic dataset designed for anti-money laundering and fraud detection research. The data is hosted on Kaggle, but the author, organization, and creation date are unknown. The specific number of records and features is not provided.
The State of California's Department of Water Resources provides geospatial and tabular data from its ongoing Basin Characterization Program. This collection integrates new and existing data like AEM surveys, lithology logs, and geophysical logs to create 3D models and maps of groundwater aquifers. It supports local and statewide groundwater management under California's Bulletin 118 framework.
London's metrics for monitoring the Mayor's equality, diversity and inclusion strategy, published in 2018 and updated with new objectives in 2022. The dataset brings together publicly available data into a series of measures, which are updated regularly and visualized in dashboards. It is produced by the Greater London Authority and was last updated in March 2026.
Vehicle Registration Transactions by Department of Licensing shows counts of transactions for vehicles authorized for public roads. The dataset is hosted by data.wa.gov and was last updated on March 19, 2026. It includes columns for vehicle make, model, fuel type, owner type, county, and transaction details.
Bilateral agreements between Canada and Estonia cover trade, commerce, and taxation. The collection includes a trade and commerce agreement and a taxation convention. It was compiled by Global Affairs Canada and is archived as of March 2026.
Four bilateral agreements between Canada and Hungary cover mutual legal assistance, taxation, investment protection, and air transport. Global Affairs Canada compiled these archived legal texts, which were last updated on the platform in March 2026. The documents are provided for research and recordkeeping purposes.
Amazon provides a dataset of 548,000 anonymized seller support contacts across 118 intents from 70,000 sellers sampled from recent years. The data includes de-identified seller IDs, perturbed inter-arrival times between contacts, and contact intent codes. It is released under the CDLA-Permissive-1.0 license.
1082 gamma-ray bursts detected by the BeppoSAX satellite's Gamma Ray Burst Monitor, with 40-700 keV fluences from 1.3e-7 to 4.5e-4 erg/cm² and peak fluxes from 3.7e-8 to 7.0e-5 erg/cm²/s. This catalog was created by the NASA HEASARC in January 2010 based on the CDS catalog J/ApJS/180/192.
The Tropical Rain Forest Information Center (TRFIC) hosts satellite data for the SAFARI 2000 international science initiative. The collection includes Landsat 7 ETM+, Landsat 5 TM, and IKONOS imagery, which is geocoded and orthorectified. Access is restricted to registered SAFARI 2000 participants.
The Southwestern Pacific Regional OBIS Node contains marine biodiversity data primarily from New Zealand's Exclusive Economic Zone. Data likely includes species presence records from a series of research trawl surveys for fisheries management and decades of marine invertebrate research sampling. The dataset is a work in progress, with new data continually being added, and aims to eventually cover an area from Antarctica to Fiji.
NIWA's Marine Biodata Information System serves as the Southwestern Pacific Regional OBIS Node. The dataset contains results from research trawl surveys for fisheries management within the New Zealand EEZ, decades of marine invertebrate research sampling, and presence data for coralline algae. It is described as a work in progress, with new data continually being added.
NOAA Great Lakes Environmental Research Laboratory and the University of Michigan's Cooperative Institute for Great Lakes Research have collected physical, chemical, and biological water quality data in western Lake Erie since 2012. The dataset includes weekly discrete sampling and real-time buoy measurements from May to October each year. Parameters cover wind speed, water temperature, nutrients like phosphorus and nitrate, algal pigments, and toxins such as microcystin.
NIWA's Marine Biodata Information System warehouses results from New Zealand's fisheries trawl surveys and decades of marine invertebrate research. Data includes presence records for coralline algae along the coast. This ongoing project continually integrates new marine data for the Southwest Pacific region.
A sampling of Hierarchical Data Format version 4 (HDF4) data archived across eight NASA Earth Science Data Centers. The collection was created for a collaborative study by The HDF Group, GES-DISC, and NSIDC to assess file byte layouts and prototype layout maps. The resulting maps enable file reading without using the HDF API.
A Geographic Information System (GIS) database integrates seismological, geophysical, and geological data for the Middle East and North Africa region. The system was developed by SCIOPS to support Comprehensive Nuclear-Test-Ban Treaty (CTBT) monitoring and decision-making. It includes original results like crustal structure models and basement depth values.
Global atmospheric parameters from the GEOS-5 Forward Processing for Instrument Teams (FP-IT) assimilation product are co-located with the OMI/Aura VIS instrument's orbital swath. The dataset includes surface pressure, vertical temperature and wind profiles, tropopause pressure, boundary layer top pressure, and surface geopotential. It was produced by the Global Modeling and Assimilation Office (GMAO) for use by the OMI team and related research.
OMUFPITMET provides selected GEOS-5 Forward Processing for Instrument Teams assimilated parameters co-located with the OMI/Aura UV-2 satellite swath. The product includes surface pressure, vertical temperature profiles, wind profiles, and tropopause pressure, with vertical layers reduced from 72 to 47 to manage file size. It was generated by the Global Modeling and Assimilation Office for use by the OMI team and related research.
Monnaie_Transactionnelle_Données likely contains data for a simulated monetary market. The description suggests it models a market where supply and demand are expressed. The author, organization, and specific data volume are unknown.
An RStudio script used for data analysis to calculate Ordinary Least Squares (OLS) regressions of predator and prey Equivalent Spherical Diameter (ESD). The script was authored by Jared Richards and is intended for use alongside the S5_Dataset. It was last updated on April 13, 2026.
Biological validation results for the top 20 gene predictions from the CALDERA tool across 93 traits in the UK Biobank. The 13.5 KB Excel file includes a 'QTL' column indicating the strongest quantitative trait locus evidence for each gene, as referenced in the Weeks et al. study. Author Marijn Schipper published this dataset on figshare in March 2026.