Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
VisionFoundry-10K provides 10,000 synthetic image-question-answer triples across 10 vision-centric tasks, released by TheMartyr in 2026. The data is produced via a pipeline where an LLM generates prompts, a text-to-image model synthesizes visuals, and a multimodal verifier filters for alignment.
The dataset is provided in Parquet format and was generated using the VisionFoundry pipeline.