Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Tiny Codes is a synthetic dataset containing 1.6 million short code snippets designed to help large language models reason with natural and programming languages. It covers a wide range of languages including Python, JavaScript, Rust, C++, SQL, and Cypher, created by nampdn-ai and last updated in September 2023.
The dataset is described as synthetic; users should validate its applicability for tasks requiring real-world code. The license is tagged as MIT but should be verified on the dataset page.