Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A benchmark for video question answering and temporal grounding, likely sourced from the NExT-GQA dataset. The dataset was created by Kwai-Keye and last updated on Hugging Face on 2026-05-20. It is designed to evaluate models that perform on-demand temporal grounding to locate relevant video segments before answering questions.
License is unknown; users must verify licensing terms before use.