Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Tmax 15K Open Instruct is a collection of approximately 15,000 reinforcement learning environment instances used to train the Tmax 9B and other Tmax models. The dataset was created by AllenAI and is formatted for use with their open-instruct fork. Details on its generation and composition are available in the associated research paper.
License is unknown; terms of use must be verified before application.