Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Featuring chunked content from 12 open-source mathematics textbooks, including works like 'An Infinitely Large Napkin' and 'Mathematical Reasoning: Writing and Proof'. It is intended for retrieval-augmented generation, embedding, and math reasoning research. The source code for the data pipeline is publicly available on GitHub.
Textbooks have varying open-source licenses (e.g., CC BY-SA 4.0, CC BY-NC-SA 3.0); users must comply with individual license terms. The dataset page on Hugging Face may contain a more complete book list.