Loading...
Loading...
Offline RL trajectories, game data, robot demonstrations, RLHF, multi-agent interaction
10,022 datasets
The Zillow Transaction and Assessment Database (ZTRAX) contains over 400 million public records across more than 2,750 U.S. counties. It includes deed transfers, mortgages, foreclosures, and property characteristics for approximately 150 million parcels. The dataset is sourced from a third-party provider and supplemented by Zillow's County Direct program, with temporal coverage varying by county but generally sufficient from April 1996 onward.
Bathythermograph (XBT) data from US Navy Ships of Opportunity provides temperature profile measurements in the Northeast Atlantic Ocean. Data collection occurred from May 30 to June 11, 1980, supported by the Gulf of Mexico NOAA/NMFS Ships of Opportunity project.
A video dataset titled 'neurogolf-convseries-part3-supportvideo' published on Kaggle. The dataset's specific content, size, and creation details are not provided in the available metadata.
129 teachers and 2,069 Primary Four students participated in a study on Chinese reading in Hong Kong. The dataset contains teacher self-efficacy scores, student evaluations of instructional quality, and student attainment test results. It supports mixed-method analysis of the relationship between teacher factors and student reading achievement.
This dataset supports a study on the electoral influence of progressive Catholic bishops in Brazil following the appointment of Pope John Paul II. It leverages as-if random variation in municipalities' exposure to these bishops to analyze their impact on support for the left-wing Workers' Party (PT).
Electrophysiological recordings acquired using Neuropixels probes in different mice and labs, targeting the same brain locations including posterior parietal cortex, hippocampus, and thalamus. The dataset is hosted on AWS and published by the International Brain Laboratory under a CC-BY-4.0 license. The specific number of recordings, rows, and last update date are unknown.
A collection of skills for mechanistic interpretability analysis of large language models, including refusal geometry extraction and boundary surface mapping. The dataset is authored by bedderautomation and was last updated on March 11, 2026. It is designed for use with Claude Code, OpenAI Codex, and Gemini CLI, supporting the agentskills.io standard.
A dataset of Ethereum wallet transactions, published on the Hugging Face platform by Arhenniuss. The dataset was last updated on April 22, 2026. The specific volume, columns, and time range of the transaction records are not detailed in the available metadata.
Brookhaven National Laboratory collected surface temperature, salinity, and pCO2 data via bottle casts from the METEOR vessel. Measurements were taken in the North Atlantic Ocean between September 3 and September 22, 1991. The data were submitted as part of the World Ocean Circulation Experiment (WOCE) project.
Zooplankton data were collected via net casts from the R/V Alpha Helix in the Gulf of Alaska. The dataset contains measurements from 13 October 1997 to 9 May 1999. It was collected and submitted by the University of Alaska Institute of Marine Sciences as part of the Global Ocean Ecosystems Dynamics (GLOBEC) project.
Chlorophyll-a concentration profiles collected in the Atlantic Ocean and adjoining seas from 03/02/1961 to 10/21/1992. Data were gathered by multiple institutions as part of the North Atlantic Chlorophyll Profile Data Set, with support from the European Space Agency and the Canadian Department of Fisheries and Oceans.
Moltbook Factcheck Conspiracy Grok contains multi-agent social simulation data from a Reddit-like platform where AI agents autonomously post, comment, and vote. It captures how Grok 4.1 Fast agents respond to conspiracy content seeded into their feed over 1-hour experimental runs with a 60-second action cycle. The dataset was created by Ayushnangia and last updated in February 2026.
Synthetic IT Support Tickets is a dataset from Kaggle designed for experiments with large language models, knowledge graphs, and retrieval-augmented generation. The description indicates it contains artificially generated IT support ticket data. The dataset's author, organization, size, and license are unknown.
Kaggle hosts this dataset titled 'neurogolf-convseries-part2-supportvideo'. The dataset likely contains video data related to golf, possibly supporting a neuroscience or sports analysis series. The author, organization, and specific details are unknown.
Management information from the Social Security Administration's Comprehensive Work Opportunities Support System (CWOSS). The data captures workload case management, contract, and payment information to support the Ticket to Work program. The dataset was last updated on March 10, 2026.
A high-resolution vector shoreline dataset compiled from imagery of Southern Barataria Bay, Louisiana. The data is based on office interpretation of imagery and uses the NOAA-developed Coastal Cartographic Object Attribute Source Table (C-COAST) attribution scheme. It is a member of the NOAA Inport catalog item 39808.
Northern Barataria Bay, Louisiana, has a high-resolution shoreline vector dataset compiled from imagery. The data is attributed using NOAA's Coastal Cartographic Object Attribute Source Table (C-COAST) scheme and is suitable as a GIS layer. It was published by the National Oceanic and Atmospheric Administration.
A high-resolution historical shoreline for the vicinity of Pensacola, Florida, automated for use as a GIS data layer. The data were derived from shoreline maps produced by the NOAA National Ocean Service and its predecessor agencies, based on office interpretation of imagery and/or field surveys. The attribution follows the NGS-developed C-COAST scheme, influenced by the International Hydrographic Organization's S-57 standard.
A high-resolution vector shoreline dataset for the Port of Sacramento, California, compiled from imagery. The data, attributed using the NGS-developed C-COAST scheme, is suitable for GIS applications and is a member of the NOAA InPort catalog item 39808. It was last updated on 2026 03 13.
NOAA's National Geodetic Survey provides a high-resolution vector shoreline for the Sacramento River between Sacramento Bend and Garcia Bend, California. The data is compiled from imagery and structured using the C-COAST attribution scheme to align with international hydrographic standards. This resource is part of a larger NOAA shoreline mapping program, with metadata last updated in March 2026.