Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
3,625 training and 200 validation examples engineered for Reinforcement Learning with Verifiable Rewards (RLVR). The dataset, created by guox18, contains two complementary synthetic datasets with different synthesis approaches and difficulty distributions. It was last updated on August 8, 2025.
License is unknown; terms of use must be verified.