Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A collection of source code samples from multiple programming languages, created by author mesolitica and last updated on June 1, 2025. The dataset was generated using the 'Magicoder: Source Code Is All You Need' template, targeting at least 10,000 rows per language. It includes samples for languages such as C++, C#, CUDA, and Dockerfile, sourced from the deduplicated version of The Stack dataset.
License is listed as 'mit' in platform tags, but the specific terms should be verified on the dataset page.