DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

TVR: Large-Scale Video-Subtitle Moment Retrieval with Multimodal Queries | DataSalon

Home Multimodal & LLMTVR: Large-Scale Video-Subtitle Moment Retrieval with Multimodal Queries

Multimodal & LLM

TVR: Large-Scale Video-Subtitle Moment Retrieval with Multimodal Queries

Name: TVR: Large-Scale Video-Subtitle Moment Retrieval with Multimodal Queries
Creator: jayleicn
Published: 2020-01-27T01:41:06
License: MIT
Keywords: Pytorch, Tvr, Tvc, Video Retrieval

by jayleicn·Updated 2y ago

Available on 1 platform

Description

TVR provides video-subtitle pairs and natural language queries for temporal moment retrieval, introduced by Jie Lei at ECCV 2020. The collection focuses on the TV show domain, requiring models to utilize both visual and textual dialogue features to locate specific events.

Use Cases

Temporal moment localization using natural language queries and subtitle timestamps
Cross-modal retrieval by matching text descriptions to visual features and dialogue
Multimodal representation learning using video-subtitle pairs

Strengths

ECCV 2020 peer-reviewed source
Includes both visual frames and synchronized subtitle text
MIT licensed for open research

Limitations

Restricted to TV show content which may not generalize to other video domains
Requires significant storage and GPU resources for video feature extraction

Provenance

Source: ECCV 2020 paper 'TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval'
Collection Method: Human annotation of TV show clips and subtitles
Freshness: Last updated May 2024

The repository provides PyTorch code for the XML (Cross-modal Moment Localization) model; users should be prepared for high-dimensional video feature processing.

Pytorch Tvr Tvc Video Retrieval

Related Datasets

Quality Score

D25

Description

Source

Reputation

Quality Score

D25

Description

Source

Reputation

Access

Community

161 likes

0 views

Dataset Info

License: MIT
Author: jayleicn
Created: Jan 27, 2020
Updated: May 28, 2024
Language: Python
Last synced: Jun 2, 2026

Access

Community

161 likes

0 views

Dataset Info

License: MIT
Author: jayleicn
Created: Jan 27, 2020
Updated: May 28, 2024
Language: Python
Last synced: Jun 2, 2026

TVR: Large-Scale Video-Subtitle Moment Retrieval with Multimodal Queries

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info