Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
7,486 prompt-completion pairs for fine-tuning language models, derived from the SkillRL paper on evolving agents via recursive skill-augmented reinforcement learning. The dataset contains 237 distinct tasks and was authored by yananchen, with a last recorded update in April 2026. All rows contain actions within the admissible action list, indicating a focus on valid agent behavior.
License is unknown, which may restrict usage.