Multi Legal Pile contains legal documents from the European Union, covering 24 official languages. The dataset was created by joelniklaus and was last updated in January 2024. It serves as a resource for multilingual legal text analysis.
Use Cases
- Train a multilingual language model on legal text from 24 EU languages.
- Benchmark cross-lingual information retrieval systems using legal document content.
- Analyze linguistic patterns and terminology across different EU legal systems.
- Fine-tune a legal document classification model for multiple languages.
Strengths
- Covers 24 official languages of the European Union.
- Dataset was updated in January 2024.
Limitations
- The total number of documents, rows, and file size are unknown.
- Specific document sources, collection methods, and license information are not provided.
Provenance
- Source
- null
- Collection Method
- null
- Time Range
- null
- Freshness
- Last updated 2024-01-12.
- Geography
- European Union