DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

EgoAVU: Egocentric Audio-Visual Understanding Data | DataSalon

Home Media & CommunicationEgoAVU: Egocentric Audio-Visual Understanding Data

Media & Communication

EgoAVU: Egocentric Audio-Visual Understanding Data

Name: EgoAVU: Egocentric Audio-Visual Understanding Data
Creator: facebook
Published: 2026-01-09T19:47:07
Keywords: Audio Visual, Multimodal Learning, Audio, Egocentric Vision, Human Actions, Multimodal

by facebook·Updated 2mo ago

Available on 1 platform

Description

EgoAVU is a dataset for egocentric audio-visual understanding, introduced by Facebook and highlighted at CVPR 2026. The data engine enriches existing egocentric narrations by integrating human actions with environmental context, linking visible objects and sounds. The dataset was last updated on the platform on April 9, 2026.

Use Cases

Train models for audio-visual scene understanding based on linked objects and sounds.
Develop systems for automated egocentric narration enrichment based on integrated action and context.
Benchmark algorithms for human activity recognition in first-person video.
Research multimodal fusion techniques for egocentric data streams.

Strengths

Dataset is associated with a CVPR 2026 highlight paper, suggesting academic relevance.
Data engine is described as 'scalable and automated' in its methodology.
Last updated on the platform on 2026-04-09.

Limitations

Column-level documentation is absent; field semantics must be inferred after download.
Row count, file formats, and license are unknown, which may limit suitability assessment.

Provenance

Source: facebook
Collection Method: Created via a scalable, automated data engine that enriches existing egocentric narrations.
Time Range: null
Freshness: Last updated 2026-04-09 16:55:47; freshness should be verified.
Geography: null

null

Audio Multimodal Audio Visual Multimodal Learning Egocentric Vision Human Actions

Related Datasets

Quality Score

D38

Description

Source

Reputation

Quality Score

D38

Description

Source

Reputation

Access

Community

235 downloads

13 likes

0 views

Dataset Info

Author: facebook
Created: Jan 9, 2026
Updated: Apr 9, 2026
Last synced: May 2, 2026

Access

Community

235 downloads

13 likes

0 views

Dataset Info

Author: facebook
Created: Jan 9, 2026
Updated: Apr 9, 2026
Last synced: May 2, 2026

EgoAVU: Egocentric Audio-Visual Understanding Data

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info