LOCUS v1.0 is a chunk-level dataset of U.S. municipal and county law text labeled by legal function. Each eligible chunk is assigned a function, a binary is_substantive label, and all substantive provisions are assigned a topic. The dataset was created by LocalLaws and last updated on HuggingFace in May 2026.
Use Cases
- Legal text research based on chunk-level annotations
- Local-law structure analysis based on functional labels
- Substantive filtering of legal provisions based on the is_substantive binary label
- Downstream taxonomy refinement based on assigned topics for substantive provisions
Strengths
- Chunk-level annotations provide granular labels for legal text analysis
- Includes binary substantive labels and topic assignments for substantive provisions
Limitations
- Column-level documentation is absent; field semantics must be inferred after download
- Row count is unknown, which may limit suitability assessment
Provenance
- Source
- LocalLaws
- Freshness
- Last updated 2026-05-07 02:44:40
- Geography
- U.S. municipal and county jurisdictions