Sign in to view source links and access this dataset
Description
NEXUS is a multi-modal, hierarchical, temporal representation derived from the HuggingFaceFV/finevideo dataset. The primary unit is a 10-millisecond 'slice' that aggregates into moments (100 ms), seconds (1 s), experiences (10 s), and minutes (60 s). It was created by Ardea and last updated on 2025-12-29.
Use Cases
Training models for temporal action recognition based on the hierarchical slice structure.
Developing neural architectures for streaming video processing based on the 10 ms slice unit.
Researching multimodal fusion techniques based on the dataset's multi-modal nature.
Exploring hierarchical representation learning based on the aggregation from slices to minutes.
Strengths
Hierarchical temporal structure with defined units from 10 ms slices to 60 s minutes.
Derived from the established HuggingFaceFV/finevideo source.
Limitations
Column-level documentation is absent; field semantics must be inferred after download.
Row count is unknown, which may limit suitability assessment.
Description metadata is limited; actual data quality requires manual inspection after download.
Provenance
Source
Derived from HuggingFaceFV/finevideo.
Collection Method
Method of derivation is not specified in the provided description.
Freshness
Last updated 2025-12-29 02:18:25; freshness should be verified.
The full description is hosted externally; users must visit the dataset page for complete details.