Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
HuggingFaceM4 released FineVision in October 2025, a collection of 24.3 million samples featuring 17.3 million images and 88.9 million conversational turns. The dataset is designed for training open Vision-Language Models and contains 9.5 billion answer tokens.
Accessing the data requires the Hugging Face datasets library; users should use get_dataset_config_names to identify specific subsets before loading.