Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A corrected version of the Arabic subset from the EXAMS multilingual high-school benchmark addresses widespread text corruption. The dataset repairs issues like split diacritics, fragmented words, and non-Arabic glyphs found in the original PDF extraction. It was created by inceptlabs and last updated on 2026-05-04.
License is unknown; users should verify terms before use.