Cosmos-Reason1-RL: Video and Text Annotations for Embodied AI Reasoning

Name: Cosmos-Reason1-RL: Video and Text Annotations for Embodied AI Reasoning
Creator: nvidia
Published: 2025-05-16T22:33:34
Keywords: Benchmark, Robotics, Video Text Pairs, Multimodal, Reasoning Tasks

by nvidiaUpdated 1y ago

Available on 1 platform

Sign in to view source links and access this dataset

Description

Released by NVIDIA in May 2025, this multimodal dataset contains pairs of videos and text annotations for embodied reasoning tasks. It includes data from BridgeDatav2, RoboVQA, Agibot, HoloAssist, AV, and RoboFail datasets. The annotations are structured for Supervised Fine-Tuning (SFT), Reinforcement Learning (RL), and benchmarking purposes.

Use Cases

Training reinforcement learning agents for robotics based on video-text pairs.
Benchmarking embodied reasoning models on tasks from multiple source datasets.
Conducting supervised fine-tuning for vision-language models on embodied tasks.
Analyzing failure modes in robotic systems using the RoboFail benchmark data.

Strengths

Multimodal structure pairs video with text annotations, which is a key format for embodied AI.
Integrates data from at least six distinct source datasets for varied task coverage.
Released by NVIDIA, a leading institution in AI research and hardware.

Limitations

Column-level documentation is absent; field semantics must be inferred after download.
Row count, file formats, and exact data size are unknown, limiting suitability assessment.
The description references tables in a paper not provided in the input, requiring external reading for full context.

Provenance

Source: NVIDIA
Collection Method: Aggregated and annotated from multiple existing robotics and embodied AI datasets.
Freshness: Last updated 2025-05-20 06:51:06.

License is unknown; users must verify terms of use before downloading.

Multimodal Benchmark Robotics Video Text Pairs Reasoning Tasks

Related Datasets

Quality Score

D37

Description

42

Source

36

Reputation

36

Access

26

Community

141 downloads

15 likes

0 views

Dataset Info

Author: nvidia
Created: May 16, 2025
Updated: May 20, 2025
Last synced: May 30, 2026

Access

26

Community

141 downloads

15 likes

0 views

Dataset Info

Author: nvidia
Created: May 16, 2025
Updated: May 20, 2025
Last synced: May 30, 2026

Cosmos-Reason1-RL: Video and Text Annotations for Embodied AI Reasoning

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info