Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
PaTaRM-data is the training data for the PaTaRM series, a collection for aligning language models. The dataset contains two subsets: one with 35.6k samples for supervised fine-tuning (SFT) and another with 41.7k samples for reinforcement learning (RL). It was created by AIJian and last updated on April 1, 2026.
For full details including data format and usage examples, users must refer to the main collection README linked in the description.