DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

VideoTemp Bench: Video Question Answering with Agentic Temporal Grounding | DataSalon

Home Computer VisionVideoTemp Bench: Video Question Answering with Agentic Temporal Grounding

Computer Vision

VideoTemp Bench: Video Question Answering with Agentic Temporal Grounding

Name: VideoTemp Bench: Video Question Answering with Agentic Temporal Grounding
Creator: Kwai-Keye
Published: 2026-05-18T07:49:59
Keywords: Question Answering, Computer Vision, Video Understanding, Time Series, Video, Temporal Grounding, Multimodal

by Kwai-Keye·Updated 1mo ago

Available on 1 platform

Description

A benchmark for video question answering and temporal grounding, likely sourced from the NExT-GQA dataset. The dataset was created by Kwai-Keye and last updated on Hugging Face on 2026-05-20. It is designed to evaluate models that perform on-demand temporal grounding to locate relevant video segments before answering questions.

Use Cases

Benchmarking video question-answering models based on the described agentic pipeline.
Evaluating temporal grounding algorithms based on the iterative refinement process mentioned.
Training models for multimodal reasoning that integrate visual evidence with natural language queries.

Strengths

Dataset is associated with a published research concept (VideoTemp-o3) for harmonizing temporal grounding and video understanding.
Last update timestamp is explicitly provided: 2026-05-20 12:40:21.

Limitations

Row count, column definitions, and file formats are unknown, which limits suitability assessment.
Description metadata is limited; actual data quality requires manual inspection after download.
Column-level documentation is absent; field semantics must be inferred after download.

Provenance

Source: NExT-GQA (https://github.com/doc-doc/NExT-GQA)
Freshness: Last updated 2026-05-20 12:40:21; freshness should be verified.

License is unknown; users must verify licensing terms before use.

Time Series Video Multimodal Question Answering Computer Vision Video Understanding Temporal Grounding

Related Datasets

Quality Score

D38

Description

Source

Reputation

Quality Score

D38

Description

Source

Reputation

Access

Community

62 downloads

1 likes

0 views

Dataset Info

Author: Kwai-Keye
Created: May 18, 2026
Updated: May 20, 2026
Last synced: Jun 8, 2026

Access

Community

62 downloads

1 likes

0 views

Dataset Info

Author: Kwai-Keye
Created: May 18, 2026
Updated: May 20, 2026
Last synced: Jun 8, 2026

VideoTemp Bench: Video Question Answering with Agentic Temporal Grounding

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info