DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

TVQA+: Spatio-Temporal Grounding for Video Question Answering | DataSalon

Home Multimodal & LLMTVQA+: Spatio-Temporal Grounding for Video Question Answering

Multimodal & LLM

TVQA+: Spatio-Temporal Grounding for Video Question Answering

Name: TVQA+: Spatio-Temporal Grounding for Video Question Answering
Creator: jayleicn
Published: 2019-04-22T19:45:26
License: MIT
Keywords: Pytorch, Video Question Answering, Tvqa

by jayleicn·Updated 3y ago

Description

TVQA+ provides spatio-temporal grounding labels for video question answering tasks. Developed by researchers for ACL 2020, the dataset facilitates multi-modal reasoning by linking natural language questions to specific video frames and regions.

Use Cases

Training models for spatio-temporal grounding using bounding box annotations
Developing multi-modal QA systems that align subtitle text with visual frames
Benchmarking visual reasoning by localizing objects mentioned in question-answer pairs

Strengths

Provides spatio-temporal grounding via bounding box annotations
Integrates video frames with subtitle text and QA pairs
Peer-reviewed methodology from ACL 2020

Limitations

Restricted to television show environments which may not generalize to other domains
Static dataset with no updates since 2022

Provenance

Source: ACL 2020 Publication: TVQA+: Spatio-Temporal Grounding for Video Question Answering
Collection Method: Annotated from television show frames and subtitles
Freshness: Last updated in October 2022; static research dataset.

Requires PyTorch for the associated implementation; licensed under MIT.

Pytorch Video Question Answering Tvqa

Related Datasets

Quality Score

D21

Description

Source

Reputation

Quality Score

D21

Description

Source

Reputation

Access

Community

133 likes

0 views

Dataset Info

License: MIT
Author: jayleicn
Created: Apr 22, 2019
Updated: Oct 25, 2022
Language: Python
Last synced: May 19, 2026

Access

Community

133 likes

0 views

Dataset Info

License: MIT
Author: jayleicn
Created: Apr 22, 2019
Updated: Oct 25, 2022
Language: Python
Last synced: May 19, 2026

TVQA+: Spatio-Temporal Grounding for Video Question Answering

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info