Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Instruction-output pairs for training base agent policies in interactive decision-making environments. The dataset was created by author Jianwen for the SkillRL research paper and was last updated on 2026-04-12. It contains examples with structured instructions and corresponding expert actions.
License is unknown; users should verify licensing terms before use.