DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Reinforcement Learning Datasets | DataSalon

All Categories

🎮

Reinforcement Learning

Offline RL trajectories, game data, robot demonstrations, RLHF, multi-agent interaction

10,046 datasets

Farrago Confusanearum: A Collection of Dissenting Opinions on the Lord's Supper

A Latin text collection titled 'Farrago confusanearum et inter se dissidentium opinionum de Coena Domini:ex sacramentarioru[m] libris congesta'. The dataset is hosted on the paperswithcode platform, which primarily aggregates resources for computer science and machine learning. The original author, organization, and creation date are unknown.

TextHistorical TextReligious DebateComputer ScienceTheologyLatin Text+1

0 views

Reinforcement Learning

Construction and Demolition Waste Analysis for Circular Economy

Construction and demolition waste: challenges and opportunities in a circular economy is a dataset from paperswithcode. The dataset likely contains research or analysis related to waste management in the construction sector. Its specific content, size, and authorship are not detailed in the provided metadata.

TextEcologyEngineeringCivil EngineeringCircular EconomyBusinessDemolitionDemolition WasteConstruction Waste+1

0 views

Reinforcement Learning

Transactions of the Cambridge Bibliographical Society Publications

A collection of publications from the Cambridge Bibliographical Society, aggregated on the Papers with Code platform. The dataset's specific content, size, and structure are not detailed in the available metadata. It is listed under a closed license, and the original author and organization are unknown.

TextLibrary ScienceHistoryComputer ScienceClassicsBibliographyAcademic Publications+1

0 views

Reinforcement Learning

Transactions of the Historical Society of Ghana, Academic Journal Collection

Transactions of the Historical Society of Ghana is a collection of academic journal articles. The dataset is aggregated from the paperswithcode platform, which suggests a focus on historical and geographical research related to Ghana. Specific details on volume, authorship, and publication dates are not provided in the metadata.

TextHistorical TextGhana HistoryGeographyAcademic Publications+1

0 views

Reinforcement Learning

UN Special Rapporteur Report on Violence Against Women

A report from the United Nations Special Rapporteur on violence against women, its causes and consequences. The dataset likely contains textual analysis and findings on this human rights issue. It is hosted on the Papers with Code platform.

TextHuman RightsViolence Against WomenPsychologyCriminologyPolitical Science+1

0 views

Reinforcement Learning

IASC Guidelines on Mental Health and Psychosocial Support in Emergency Settings

A document titled 'IASC Guidelines on Mental Health and Psychosocial Support in Emergency Settings' authored by Elizabeth Carll and hosted on the paperswithcode platform. The content likely provides structured recommendations and frameworks for mental health interventions in crisis situations. The dataset's specific format, size, and update history are not provided in the metadata.

TextMental HealthGuidelinesPsychologyPsychosocial SupportHealthcarePsychosocialEmergency ResponsePsychiatry+1

0 views

Reinforcement Learning

Opportunity Insights Economic Tracker for the United States

Timely and granular datasets on consumer spending, job openings, and other economic indicators. The data is provided by Opportunity Insights. The dataset likely contains high-frequency economic metrics.

TabularTime SeriesCovid19EconomicsEconomic TrackerFinanceBusinessConsumer SpendingJob Openings+1

0 views

Reinforcement Learning

Fraud Detection Data with 1 Million Transactions and 7 Fraud Types

One million financial transactions labeled with seven distinct fraud types. The description suggests the data includes features for fraud rings and behavioral patterns, and notes the presence of class imbalance. The dataset is hosted on Kaggle, but author, organization, and license details are unknown.

TabularEconomicsBankingInsuranceFinanceTransaction DataInvestingFraud DetectionClass Imbalance+1

0 views

Reinforcement Learning

Reproductive Health Economics in Accra, Ghana

A PopPov Research Brief authored by Mahesh Karra examines the economics of reproductive health in Accra, Ghana. The dataset likely contains socioeconomic and health indicators related to fertility, family planning, and earnings. Specific details on data volume, temporal coverage, and collection methodology are not provided in the input.

TabularMedicineSocioeconomic StatusEnvironmental HealthEarningsDeveloping CountryReproductive HealthFertilityFamily PlanningEconomicsHealthcareResearch MethodologyPopulationBusinessEconomic GrowthSocioeconomicsPublic Health+1

0 views

Reinforcement Learning

French Real Estate Transaction Records

A preprocessed collection of real estate transaction data from France. The dataset is intended for analysis of the French housing market, supporting tasks like visualization and exploratory data analysis. Specific details on the number of records and features are not provided.

HousingData VisualizationExploratory Data AnalysisData StorytellingReal Estate+1

0 views

Reinforcement Learning

Nemotron-Cascade-RM-Training: 81,808 Prompts for Reward Model Development

81,808 samples of prompts and associated metadata form this dataset designed for training reward models in reinforcement learning from human feedback (RLHF). Created by NVIDIA, this collection is a curated subset from multiple sources and was last updated in December 2025. The dataset is explicitly noted as ready for commercial use.

TextRlhfPreference LearningPrompt EngineeringReward Model+1

0 views

Reinforcement Learning

Reward Model Training Prompts for RLHF

NVIDIA's Nemotron-Cascade-RM-Training dataset provides 81,808 samples for training reward models in reinforcement learning from human feedback (RLHF). It contains prompts, data sources, and category information. The dataset was published by NVIDIA in December 2025.

TextRlhfPreference LearningPrompt EngineeringReward Model+1

0 views

Reinforcement Learning

Embeddings for a Corpus of Sacred Texts from Multiple Traditions

A Kaggle dataset providing vector embeddings for a collection of sacred texts. The corpus likely spans multiple religious or spiritual traditions, enabling computational analysis. The specific texts, embedding model, and dataset scale are not detailed in the available metadata.

TextReligious StudiesNatural Language ProcessingText EmbeddingsSacred Texts+1

0 views

Reinforcement Learning

Master Support Indices

Master_support_indices.json is a dataset published on Kaggle. The title suggests it likely contains numerical indices or metrics related to support systems or performance. Its specific content, scale, and authorship are unknown from the provided metadata.

TabularSupport IndicesKagglePerformance Metrics+1

0 views

Reinforcement Learning

Spanish Technical Support Tickets, 20K Records

20,000 technical support tickets written in Spanish, sourced from Kaggle. The dataset is likely intended for natural language processing tasks. Its specific origin, creation date, and detailed structure are not provided in the available metadata.

TextCustomer ServiceSpanish-languageText ClassificationTechnical SupportText Data+1

0 views

Reinforcement Learning

Flippo Image Dataset

Kaggle hosts a dataset titled 'flippo_img_dataset'. The dataset likely contains images, as suggested by its title. No further details on size, creator, or update date are available.

ImageMachine LearningComputer Vision+1

0 views

Reinforcement Learning

VICIdial Asterisk Call Center Logs

VICIdial/Asterisk data provides telephony logs and metrics from call center operations. The dataset's volume, creator, and temporal coverage are unspecified. It originates from the open-source VICIdial call center software platform.

TabularCustomer ServiceVoipCall CenterTelephony+1

0 views

Reinforcement Learning

X2Edit: Image Editing Dataset for 14 Diverse Tasks

The X2Edit Dataset is an image editing collection covering 14 diverse tasks, developed by OPPOer and hosted on Hugging Face. It was last updated on December 30, 2025. The dataset description claims it exhibits advantages over several existing open-source image editing datasets.

ImageMultimodalWEBDATASETGenerative AiLibrarywebdatasetSize Categories10 Mn100 MModalitytextLibrarymlcroissantModalityimageLibrarydatasetsComputer VisionArxiv250807607RegionusLicenseapache 20+1

0 views

Reinforcement Learning

Combined Reasoning and Thinking Dataset for RL and SFT Training

comoZ's Reasoning Dataset is a compiled collection for training reasoning models, containing RL and SFT subsets. The RL subset provides high-quality ground truth pairs with task_type and rubrics for reward modeling. The SFT subset offers instruction-following data with tags to model thinking processes.

TextEnglishMathematicsComputer ProgrammingReinforcement Learning+1

0 views

Reinforcement Learning

Supporting Data for Nonreciprocal Coulomb Drag in Electron-Hole Bilayers

Codes, figures, and other data supporting the results of a physics paper on nonreciprocal perfect Coulomb drag and coherent exciton superflow in electron-hole bilayers. The data was authored by Jun-Xiao Hui and is hosted by Harvard Dataverse. The specific number of rows, columns, and file formats is unknown.

Physics+1

0 views

PreviousPage 415 of 501Next