DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

StreamGaze: Multimodal LLM Benchmark for Gaze-Based Video QA | DataSalon

Home NeuroscienceStreamGaze: Multimodal LLM Benchmark for Gaze-Based Video QA

Neuroscience

StreamGaze: Multimodal LLM Benchmark for Gaze-Based Video QA

Name: StreamGaze: Multimodal LLM Benchmark for Gaze-Based Video QA
Creator: daeunni
Published: 2025-11-21T16:09:58
Keywords: Task Categoriesquestion Answering, Languageen, Multimodal Llm, Benchmark, Question Answering, Video Benchmark, Licensecc By 40, Gaze Tracking, Regionus, Multimodal

by daeunni·Updated 3mo ago

Available on 1 platform

Description

StreamGaze is a benchmark dataset for evaluating Multimodal Large Language Models on gaze-based question-answering tasks. The dataset likely contains fixation metadata from three sources: EGTEA, EgoExoLearn, and HoloAssist, and QA tasks across past, present, and future contexts. It was created by daeunni and last updated on Hugging Face in March 2026.

Use Cases

Benchmarking MLLM performance on gaze-based QA tasks based on the described video benchmark.
Evaluating model understanding across temporal contexts (past, present, future) based on the description.
Training models to interpret human fixation patterns in video streams based on the fixation metadata mentioned.

Strengths

Benchmark is structured for evaluating MLLMs on a specific task: gaze-based QA.
Dataset integrates metadata from three distinct sources: EGTEA, EgoExoLearn, and HoloAssist.

Limitations

Column-level documentation is absent; field semantics must be inferred after download.
Row count, file formats, and dataset size are unknown, which may limit suitability assessment.

Provenance

Source: Hugging Face dataset uploaded by daeunni.
Freshness: Last updated 2026-03-30 04:29:37; freshness should be verified.

Multimodal Task Categoriesquestion Answering Languageen Multimodal Llm Benchmark Question Answering Video Benchmark Licensecc By 40 Gaze Tracking Regionus

Related Datasets

Quality Score

D37

Description

Source

Reputation

Quality Score

D37

Description

Source

Reputation

Access

Community

184 downloads

2 likes

0 views

Dataset Info

Author: daeunni
Created: Nov 21, 2025
Updated: Mar 30, 2026

Access

Community

184 downloads

2 likes

0 views

Dataset Info

Author: daeunni
Created: Nov 21, 2025
Updated: Mar 30, 2026

StreamGaze: Multimodal LLM Benchmark for Gaze-Based Video QA

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info