DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

OpenThoughts-Agent-v1-RL: 720 Tasks with Verifiers for Agentic RL | DataSalon

Home Multimodal & LLMOpenThoughts-Agent-v1-RL: 720 Tasks with Verifiers for Agentic RL

Multimodal & LLM

OpenThoughts-Agent-v1-RL: 720 Tasks with Verifiers for Agentic RL

Name: OpenThoughts-Agent-v1-RL: 720 Tasks with Verifiers for Agentic RL
Creator: open-thoughts
Published: 2025-12-05T00:17:59
Keywords: Librarypolars, Size Categoriesn1 K, Modalitytext, Librarymlcroissant, Librarydatasets, Librarypandas, Parquet, Regionus

by open-thoughts·Updated 5mo ago

Available on 1 platform

Description

OpenThoughts-Agent-v1-RL provides approximately 720 curated reinforcement learning tasks designed for training agentic models, released by the open-thoughts project in January 2026. The collection includes instructions, environment configurations, and verifiers specifically optimized for benchmarks like Terminal-Bench 2.0 and SWE-Bench.

Use Cases

Training agents to execute terminal commands using the environment and verifier components
Fine-tuning models for software engineering automation based on SWE-Bench style instructions
Developing reward functions for RLHF using the success signals from the verifier fields

Strengths

Contains ~720 curated tasks
Includes integrated verifiers for automated performance evaluation
Optimized for Terminal-Bench 2.0 and SWE-Bench standards

Limitations

Small sample size of approximately 720 records
Limited to text-based agentic modalities

Provenance

Source: open-thoughts project
Collection Method: curated
Freshness: Last updated January 2026.

Distributed in Parquet format; requires environment-compatible runners to utilize the included verifiers and environment configurations.

Parquet Librarypolars Size Categoriesn1 K Modalitytext Librarymlcroissant Librarydatasets Librarypandas Regionus

Related Datasets

Quality Score

D39

Description

Source

Reputation

Quality Score

D39

Description

Source

Reputation

Access

Community

325 downloads

12 likes

0 views

Dataset Info

Author: open-thoughts
Created: Dec 5, 2025
Updated: Jan 27, 2026
Last synced: Jun 5, 2026

Access

Community

325 downloads

12 likes

0 views

Dataset Info

Author: open-thoughts
Created: Dec 5, 2025
Updated: Jan 27, 2026
Last synced: Jun 5, 2026

OpenThoughts-Agent-v1-RL: 720 Tasks with Verifiers for Agentic RL

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info