Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
66,000 human-annotated audio samples of spoken mathematical equations and sentences in English and Russian form the Speech2LaTeX dataset. It is the first fully open-source large-scale dataset for converting spoken math to LaTeX, drawn from diverse scientific domains. The dataset was created by marsianin500 and last updated on November 16, 2025.
License is unknown; users should verify terms before use.