Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A curated collection of five established code instruction datasets formatted for LLM training. The datasets, including Magicoder-OSS-Instruct-75K and glaive-code-assistant-v3, have been processed into the LLAMA chat format with markdown for code snippets. It was created by MaLA-LM and last updated in July 2024.
License information for the aggregated dataset is unknown; users should verify the licenses of the original source datasets.