Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
OpenDataArena published ODA-Fin-RL-12K in March 2026, providing 12,187 hard-but-verifiable samples for reinforcement learning in the financial domain. The dataset focuses on complex reasoning tasks with concise answers optimized for automated reward modeling and distillation.
Released under Apache 2.0 license; intended as a reinforcement learning complement to the ODA-Fin-SFT-318K dataset.