Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
OpenThoughts-Agent-v1-RL provides approximately 720 curated reinforcement learning tasks designed for training agentic models, released by the open-thoughts project in January 2026. The collection includes instructions, environment configurations, and verifiers specifically optimized for benchmarks like Terminal-Bench 2.0 and SWE-Bench.
Distributed in Parquet format; requires environment-compatible runners to utilize the included verifiers and environment configurations.