Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Document Haystack is a benchmark dataset for evaluating multimodal Large Language Models on long-context image and document understanding tasks. It was created by AmazonScience for a 2025 research paper to address the lack of suitable benchmarks for processing long documents. The specific row count, column count, and data size are not provided in the input.
The full dataset description is hosted externally at https://huggingface.co/datasets/AmazonScience/document-haystack. License information is unknown.