Sign in to view source links and access this dataset
Description
Manually collected from the book Fath Al-Kabir Al-Muta‘al fi I‘rab Al-Mu‘allaqat Al-‘Ashr Al-Tiwal, this dataset provides detailed linguistic and semantic annotations for the complete Ten Mu‘allaqat poems. Each entry represents a single verse and includes fields for poet name, verse text, vocabulary explanation, meaning, and grammatical analysis. The dataset was created by SarahALo and last updated on Hugging Face in May 2026 to support Arabic Natural Language Processing and educational applications.
Use Cases
Train grammatical parsers for classical Arabic based on the detailed I'rab (grammatical analysis) annotations.
Develop educational applications for teaching classical Arabic poetry using the vocabulary explanations and meanings provided.
Conduct semantic analysis of pre-Islamic Arabic poetry using the verse-level meaning annotations.
Build named entity recognition or poet attribution models based on the poet name and verse text fields.
Strengths
Dataset contains annotations for the complete set of Ten Mu'allaqat poems, a foundational corpus of pre-Islamic Arabic literature.
Each verse entry includes multiple structured annotation fields: poet name, verse, verse number, vocabulary explanation, meaning, and grammatical analysis.
Limitations
Row count and dataset size are unknown, which may limit suitability assessment for large-scale model training.
Column-level documentation is absent; field semantics must be inferred after download.
Last updated 2026-05-22 19:41:24; freshness should be verified.
Provenance
Source
Manually collected and organized from the book Fath Al-Kabir Al-Muta‘al fi I‘rab Al-Mu‘allaqat Al-‘Ashr Al-Tiwal.
Collection Method
Manual collection and organization.
Freshness
Last updated 2026-05-22 19:41:24.
License information is unknown; users should verify permissions before use.