Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A curated dataset for binary text classification focused on identifying Moroccan Darija (Arabic script or Arabizi) versus other dialects, Modern Standard Arabic, or languages like English. The dataset was built from multiple public sources, cleaned, and taxonomized by author atlasia. It was last updated on 2026-06-07.
License is unknown; check the dataset page for terms before use.