Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
PhoStream contains 5,572 open-ended QA pairs derived from 578 videos across 4 scenarios and 10 capabilities, released by lucky-lance in 2026. This benchmark evaluates omnimodal assistants in mobile-centric streaming environments, focusing on both on-screen and off-screen phone usage. It specifically tests a model's ability to determine both the timing and the content of responses while processing continuous audio-visual streams.
The dataset is licensed under CC BY 4.0. Users should be prepared to handle streaming video and audio data for evaluation.