Loading...
Loading...
Offline RL trajectories, game data, robot demonstrations, RLHF, multi-agent interaction
10,031 datasets
QIIME 2 is a free, open-source microbiome multi-omics bioinformatics platform. The dataset, provided by the Caporaso Lab, is released under a BSD 3-Clause License and is hosted on AWS Open Data. The last update date is unknown.
This dataset supports a study on fiscal policy determination under council-manager and mayor-council government forms in US cities. The analysis includes cross-sectional and panel analyses of changes in government form to test predictions about public spending levels.
Approximately 17,000 successful robot teleoperation episodes filtered for high-quality camera extrinsics. The data is a subset of DROID-COMMUNITY, ported from its raw format to the LeRobotDataset v3.0 structure by the author jnogga. It was last updated on February 16, 2026.
State of California data provides distributed collection figures for child support cases, categorized by county and public assistance type. The dataset sorts cases into Current or Former Public Assistance, or Never Assisted categories, covering cases where children received aid like AFDC, TANF, Foster Care, or Medicaid.
African Next Voices: Pilot Data Collection in Kenya is part of a larger initiative to support African language speech technology. The project is funded by the Gates Foundation and led by the KenCorpus Consortium. The dataset is a work in progress, with updates continuing through September 2025.
This dataset supports a study examining the path-dependence of knowledge-intensive industry location in Russia following the end of the Soviet planned economy in 1991. It analyzes the relationship between the 1991 geographic distribution of R&D personnel and the subsequent development of market-oriented knowledge-intensive business services like engineering and IT. The dataset was authored by Denis Ivanov.
397 US voters were surveyed in 2017 regarding their party identity and stances on 20 policy issues. Created by Philip Moniz, this dataset compares these self-reported responses against commercial vendor predictions to assess the accuracy of political microtargeting.
Zooplankton biomass data, including displacement and settled volume measurements, were collected aboard the R/V Eltanin during the U.S. Antarctic Research Program. Sampling occurred from April 5, 1963 to March 12, 1967 in support of Antarctic research. The dataset is archived by NOAA's National Centers for Environmental Information.
Risk_Level_Classification is a dataset for classifying transaction risk levels, updated by Jack Ward. The target variable 'anomaly' is treated as a nominal variable with three categories: low risk, moderate risk, and high risk. The dataset is licensed under CC-BY-4.0 and is hosted on OpenML.
DDOT maintains centerline data for all roads and alleys open to traffic in the District of Columbia. This geospatial dataset supports transportation infrastructure analysis and urban planning.
Cairo is an R graphics device using the cairographics library to create high-quality output. It supports vector formats like PDF and SVG, bitmap formats like PNG and JPEG, and display rendering for X11 and Win32 systems. The device is authored by Simon Urbanek and provides WYSIWYG copying across formats due to a unified back-end.
Replication Data for a think-aloud study on speech disfluencies as a measure of skill automaticity in psychotherapy. The dataset was authored by Dan Sacks and last updated on March 28, 2026. It is hosted on the Dataverse platform under the Social Sciences category.
A policy statement authored by George W. Bush outlining the foreign policy approach against international terrorism in the early period following the 9/11 attacks. The text presents the administration's stance that nations harboring or supporting terrorism would be considered hostile. The dataset is sourced from the paperswithcode platform and is licensed as closed.
Walter LaFeber's historical text analyzes U.S. expansion and industrialization from 1865 to 1913. The work likely contains narrative and thematic analysis of events like the Second Industrial Revolution and the empire of 1898. Its structure includes chapters, a bibliographic essay, and an index.
Lubna Z. Qureshi's work, 'Nixon, Kissinger, and Allende: U.S. Involvement in the 1973 Coup in Chile', is a structured historical text. The content is organized into chapters covering U.S.-Latin American relations, the 1970 election, and events from 1971 to 1973. The dataset appears to be a textual analysis of political events sourced from the paperswithcode platform.
A historical text collection details Japanese diplomatic missions and student travels to America and Europe from 1860 to 1873. The work by W. G. Beasley covers topics including trade, cultural borrowing, military reform, and the Iwakura Embassy. The dataset is sourced from the paperswithcode platform and is licensed as closed.
Satellite remote sensing data intended to support the North Atlantic study within the Marine Productivity programme. The dataset is provided by the SCIOPS organization via the NASA EarthData platform. Its specific temporal coverage and volume are not detailed in the available metadata.
No-show appointments data, likely containing records of scheduled appointments and their attendance outcomes. The dataset is hosted on Kaggle, but its specific source, size, and creation details are not provided in the available metadata. Columns and data specifics are unknown and require verification after download.
Files support the California's Groundwater Live website, a platform for viewing and interacting with groundwater information. The dataset includes materials in JPEG, HTML, ZIP, DOC, and PDF formats, managed by the State of California and last updated in March 2026.
ASAC project 2205 produced this dataset of satellite-tracked Adelie penguin movements in the Dumont D'Urville region, Antarctica. It records foraging locations determined from satellite fixes during the 1995-1996 summer season. The dataset was last updated in March 1996.