Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
MMFace-DiT Dataset provides multimodal conditioning data for high-fidelity, controllable face synthesis. The dataset, created by BharathK333, includes spatial elements like masks and sketches paired with VLM-enriched semantic captions. It was accepted to CVPR 2026 and last updated in April 2026.
The full dataset description and components are available on the Hugging Face dataset page; key metadata like license and exact file formats are not provided in this input.