Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
DocParsingBench contains 1,400 images for document intelligence evaluation across five industry domains including finance, law, and scientific research. Released by SoMarkAI in March 2026, the collection features real-world business documents containing authentic artifacts like scanning noise and seal occlusions.
Licensed under ODC-By; available in ImageFolder format on Hugging Face.