Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
AllenAI released a collection of roughly 15,000 reinforcement learning environment instances used to train the Tmax 9B and other Tmax language models. The dataset was published on Hugging Face and last updated on June 23, 2026. Specific details on generation and composition are documented in the associated research paper.
License is unknown; users must verify terms of use before downloading.