Sign in to view source links and access this dataset
Description
Iranian Legal Question Answering Dataset (Farsi) includes over 600,000 questions and 2 million answers in written form. The questions were posed by ordinary Persian speakers, while the responses were provided by attorneys from various specialties. The dataset is maintained by PerSets and was last updated on May 18, 2025.
Use Cases
Train legal question-answering models based on the described question-answer pairs.
Analyze patterns in public legal inquiries in Iran based on the described user-generated questions.
Benchmark Persian language models on domain-specific tasks based on the described legal text corpus.
Study attorney response styles and legal reasoning in Farsi based on the described answer content.
Strengths
Contains over 600,000 user-submitted legal questions.
Includes over 2 million corresponding answers provided by attorneys.
Questions and answers are sourced from a specific legal platform (dadrah.ir).
Limitations
Column-level documentation is absent; field semantics must be inferred after download.
Row count is unknown, which may limit suitability assessment.
Description metadata is limited; actual data quality requires manual inspection after download.
Provenance
Source
dadrah.ir
Collection Method
Questions gathered from ordinary Persian speakers, with answers provided by attorneys.
Freshness
Last updated 2025-05-18 20:39:32; freshness should be verified.
Geography
Iran
License is unknown; terms of use must be verified before application.