DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

VideoNet: Benchmark for Domain-Specific Action Recognition and In-Context Video Learning | DataSalon

Home Computer VisionVideoNet: Benchmark for Domain-Specific Action Recognition and In-Context Video Learning

Computer Vision

VideoNet: Benchmark for Domain-Specific Action Recognition and In-Context Video Learning

Name: VideoNet: Benchmark for Domain-Specific Action Recognition and In-Context Video Learning
Creator: raivn
Published: 2026-04-24T05:04:52
Keywords: Video Action Recognition, Vision Language Models, Benchmark, Computer Vision, Video, Multimodal

by raivn·Updated 1mo ago

Available on 1 platform

Description

VideoNet is a dataset highlighted at CVPR 2026 for studying domain-specific action recognition and in-context video learning in Vision-Language Models (VLMs). The dataset includes benchmark MP4 files and JSONL files containing question-and-answer pairs. It was uploaded by author 'raivn' and last updated on May 6, 2026.

Use Cases

Benchmarking Vision-Language Models on domain-specific action recognition tasks based on the provided video and Q&A pairs.
Training models for in-context video learning based on the structured benchmark JSONL files.
Studying the performance of VLMs on video-based question answering based on the dataset's described structure.

Strengths

Dataset is associated with a CVPR 2026 Highlight paper, indicating academic relevance.
Includes both video files (MP4s) and structured benchmark files (JSONLs) for evaluation.

Limitations

Column-level documentation is absent; field semantics must be inferred after download.
Row count, file formats beyond MP4/JSONL, and license information are unknown.
Description metadata is limited; actual data quality requires manual inspection after download.

Provenance

Source: huggingface
Freshness: Last updated 2026-05-06 07:52:08; freshness should be verified.

Video Multimodal Video Action Recognition Vision Language Models Benchmark Computer Vision

Related Datasets

Quality Score

D37

Description

Source

Reputation

Quality Score

D37

Description

Source

Reputation

Access

Community

215 downloads

1 likes

0 views

Dataset Info

Author: raivn
Created: Apr 24, 2026
Updated: May 6, 2026
Last synced: Jun 16, 2026

Access

Community

215 downloads

1 likes

0 views

Dataset Info

Author: raivn
Created: Apr 24, 2026
Updated: May 6, 2026
Last synced: Jun 16, 2026

VideoNet: Benchmark for Domain-Specific Action Recognition and In-Context Video Learning

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info