Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
comoZ's Reasoning Dataset is a compiled collection for training reasoning models, containing RL and SFT subsets. The RL subset provides high-quality ground truth pairs with task_type and rubrics for reward modeling. The SFT subset offers instruction-following data with tags to model thinking processes.
The full description is hosted externally; users must visit the provided URL for complete details. License information is unknown.