Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Training performance data for the MiroMind-M1-RL-7B model on the AIME24 and AIME25 benchmarks. The dataset is associated with a model trained via reinforcement learning with verifiable rewards on 62,000 challenging examples. It is authored by miromind-ai and was last updated in July 2025.
The primary dataset description is a brief overview; users must visit the provided Hugging Face page for comprehensive details on data structure and access.