Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
200,000 text samples aggregated from nine public datasets, including HuggingFaceTB/cosmopedia-v2 (32.15%) and teknium/OpenHermes-2.5 (29.62%). The dataset was created by ethicalabs and last updated on March 16, 2026. It appears to be a curated collection for supervised fine-tuning of language models.
License is unknown; users must verify the license of each source dataset before use.