Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
MedHorizon is a long-context medical video benchmark created by DBD123 and last updated on 2026-05-07. It contains 340 full-procedure clinical videos paired with 1,253 multiple-choice question-answer pairs. The benchmark is designed to evaluate multimodal models on tasks requiring sparse evidence retrieval and multi-hop reasoning across long videos.
License information is unknown; terms of use should be verified before application.