Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
KIE-HVQA is a dataset supporting research on mitigating Optical Character Recognition hallucinations in multimodal large language models. The dataset was created by bytedance-research and is associated with a paper accepted by the NeurIPS 2025 Main Conference. The data likely contains multimodal document samples for evaluating and improving OCR integration in vision-language models.
License is unknown; terms of use must be verified before application.