Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
VDC-team created DialoguesEN-50k-Synthesis-Code, a dataset of 50,000 simple English dialogues synthesized by Python code. The dialogues are built from semantic blocks arranged semi-randomly by a generation algorithm and are intended for pretraining small language models. The dataset was last updated on 2026-06-18.
License is unknown; terms of use must be verified before application.