Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
NVIDIA released this collection of approximately 9 million vision-language samples in late 2025. It focuses on document understanding, visual question answering, and video-to-text tasks across multiple languages.
Licensed under CC-BY-4.0; users should note the November 2025 fix for broken <think> entries in the nights_cot subset.