Sign in to view source links and access this dataset
Description
300,929 parallel translation pairs extracted from Maldivian legal documents, court cases, and legislation. The dataset was created by alakxender and last updated on April 17, 2026. It includes text from court judgments, acts, regulations, the Constitution, and presidential decrees that has been cleaned and deduplicated.
Use Cases
Train legal-domain machine translation models based on the high-quality parallel corpus.
Fine-tune large language models for Dhivehi-English legal text generation.
Benchmark translation quality for low-resource language pairs in a specialized domain.
Analyze legal terminology and phrasing patterns across Maldivian legislation.
Strengths
Contains 300,929 translation pairs, providing substantial volume for model training.
Focuses on a specialized, high-stakes domain (legal texts) which is often underrepresented.
Text has been cleaned of markdown formatting and deduplicated, suggesting some quality control.
Limitations
Column-level documentation is absent; field semantics must be inferred after download.
Row count is unknown, which may limit suitability assessment.
The description metadata is limited; actual data quality requires manual inspection after download.
Provenance
Source
Maldivian legal documents, court cases, and legislation.
Collection Method
Extracted and processed to create a parallel corpus; text cleaned and deduplicated.
Time Range
null
Freshness
Last updated 2026-04-17 16:21:15; freshness should be verified.
Geography
Maldives
License is unknown; users must verify permissions before use.