Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
PLM-Video-Human is a collection of human-annotated resources for training Vision Language Models, focused on detailed video understanding. The dataset, created by Facebook, was last updated on May 21, -2025. Training tasks include fine-grained open-ended question answering, region-based video captioning, dense captioning, and temporal localization.
The full dataset description is hosted externally; users must visit the provided URL for complete structural details.