JurisTCU is a Brazilian Portuguese legal information retrieval resource built from the curated jurisprudence collection of the Brazilian Federal Court of Accounts (TCU). The original dataset contains 16,045 jurisprudence documents organized into more than 20 fields, including summary and decision excerpt text. It was created by ufca-llms and last updated on 2026-04-10.
Use Cases
- Train legal document retrieval models based on the summary (ENUNCIADO) and decision excerpt (EXCERTO) fields.
- Benchmark Portuguese language models on legal text understanding tasks.
- Analyze patterns in Brazilian federal administrative jurisprudence.
Strengths
- Contains 16,045 legal documents from an authoritative source, the Brazilian Federal Court of Accounts.
- Documents are structured with more than 20 metadata and textual fields, including key textual summaries and excerpts.
Limitations
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count for the specific benchmark subset is unknown, which may limit suitability assessment.
- Last updated 2026-04-10; freshness should be verified.
Provenance
- Source
- Brazilian Federal Court of Accounts (TCU) jurisprudence collection.
- Collection Method
- Curated and derived from the original collection to create a benchmark subset.
- Freshness
- 2026-04-10
- Geography
- Brazil