Loading...
Loading...
Offline RL trajectories, game data, robot demonstrations, RLHF, multi-agent interaction
10,021 datasets
OpenSWE provides 45,320 executable Docker environments synthesized from over 12,800 software repositories. The framework was created by GAIR using a multi-agent synthesis pipeline deployed on a 64-node distributed cluster and was last updated in March 2026. All Dockerfiles, evaluation scripts, and infrastructure are open-sourced to ensure full reproducibility.
ATM Cash Withdrawal Transactions (USD) provides a daily history of cash demand and withdrawals for five automated teller machines, with values converted to US dollars. The dataset is hosted on Kaggle, but the author, organization, and specific time range are not specified. The description indicates it contains daily-level data, but the number of rows and specific columns are unknown.
A dataset of real foreign exchange (Forex) market trades. It likely contains transaction records and journal entries, which may detail trade parameters and outcomes. The dataset is published on Kaggle, but its author, size, and specific time range are unknown.
Supporting information for an Irrigation And Water Resource Management Project, authored by David Sampson Issaka. The dataset is a 23.0 KB DOCX file published on figshare under a CC BY 4.0 license. No row or column counts are available.
Comprising between 100,000 and 1,000,000 parallel speech pairs for Hindi-to-English translation, released by mahendraphd in 2026. It features natural English speech sourced from TED talks paired with synthetic Hindi speech to support research in low-resource speech-to-speech translation (S2ST).
Experimental data from a study on electrokinetic remediation of severely sodic saline-alkali soil. The dataset supports a 2026 manuscript by Chenxin Du, investigating the effects of bulk density, water supply, and aluminum sulfate amendment. It is a small dataset, approximately 20.5 KB in size.
Greater London Authority data identifies specific areas with potential for increased residential and employment density. Each point represents a location where development at higher densities and more mixed use is encouraged, though below the level of designated Opportunity Areas. The data is sourced from the London Plan Consultation 2009 and was last updated in March 2026.
A 2014 document outlines the Southall Opportunity Area, designated in the 2011 London Plan, with potential for 9,000 new homes and 3,000 new jobs by 2041. The dataset, published by the Greater London Authority, is part of the Heathrow/Elizabeth Line West Growth Corridor. The document has not been tested for compliance with accessibility standards.
The Olympic Legacy Opportunity Area (OA) is a designated planning zone in London with potential for 39,000 new homes and 65,000 new jobs by 2041. The OA was designated in the 2004 London Plan and is part of the Elizabeth Line East Growth Corridor. Documents related to this area were first published in 2012 and are provided by the Greater London Authority.
The Warwick-Edinburgh Mental Well-Being Scale (WEMWBS) is a 14-item positively worded survey instrument developed to measure population-level mental well-being. It was created by an expert panel at the University of Warwick, led by Ruth Tennant, drawing on academic literature, focus group research, and psychometric testing. The scale was validated on student and general population samples, demonstrating good content validity and internal consistency.
A 2024 briefing package prepared for the incoming Chair of the Transportation Safety Board of Canada. It contains information used to brief Yoan Marier, who was appointed on August 21, 2024. The document was published by the Transportation Safety Board of Canada and last updated in March 2026.
Active adult probation supervision cases in New York City are categorized into eight specific supervision types, including Intensive Engagement and Neighborhood Opportunity Network (NeON). The dataset is published by the City of New York and was last updated on March 8, 2026. It likely contains counts of active cases on the last day of a reporting period.
Rain Gages data from the City of Seattle primarily supports the permanent Combined Sewer Overflow monitoring network to determine rainfall events and calculate rainfall depth. The dataset also supports other DWW monitoring programs for sampling, modeling, and operations. It was last updated on 2026-03-08 02:23:43.442097.
Monthly provider data for California's Medi-Cal Managed Care Plans, submitted via X12 274 Transaction files. The listing includes 24 Managed Care Plans as of January 2024, with provider details and address information.
2,084 pages constitute the contract between the London Borough of Barnet and Capita plc for Customer and Support Group services. The document details a 10-year agreement projected to save over £165 million. It includes schedules updated to reflect the return of Finance and Strategic HR services to the council in April 2019.
A transaction dataset sourced from Kaggle. The dataset's specific content, scale, and origin are not detailed in the provided metadata. Columns and data characteristics require verification after download.
MoNuSAC-converted is a dataset hosted on Kaggle. The title suggests it is a converted version of the MoNuSAC dataset, which is a benchmark for nuclei segmentation and classification in histopathology images. The dataset's specific contents, size, and provenance details are not provided in the available metadata.
MoNuSAC is a dataset hosted on Kaggle. Its title suggests it contains medical images for nuclei segmentation and classification tasks across multiple organs. The dataset's specific scale, authorship, and creation date are not provided in the available metadata.
MoNuSAC_pred likely contains model predictions for nuclei segmentation and classification in histopathology images. The dataset is hosted on Kaggle, but its exact size, creation date, and author are unspecified. Columns and sample data are unavailable, making a detailed assessment impossible without download.
Irradiance measurements from a scaled-down bifacial photovoltaic array test-bed, featuring 2 front-facing and 4 rear-facing sensors. High-quality weather-station data is also available. The data was collected by the Department of Energy.