Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Anadolu OCR Corpus is an OpenCR export of OCR text and metadata for 52 historical PDF sources in Ottoman Turkish, Turkish, and Arabic. The dataset is provided in two Hugging Face configs: 'pages' and 'documents'. It was authored by fatihburakkaragoz and last updated on 2026-05-12.
License is unknown; terms of use should be verified before application.