Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
150,334 ArXiv papers provide raw LaTeX source archives covering major AI/ML conferences from 2016 to 2026. The collection spans approximately 285 GB and is categorized by ArXiv subject codes like cs.LG and cs.CV. Vidushee compiled this dataset, last updated on March 24, 2026.
License is unknown; users must verify terms of use for ArXiv data. The ~285 GB size and .tar.gz per paper format require significant storage and processing capability.