Sign in to view source links and access this dataset
Description
518,255 official Vietnamese legal documents were compiled by author vohuutridung from the thuvienphapluat.vn repository. The collection includes laws, decrees, circulars, and decisions issued by 2,393 unique government bodies. The dataset was last updated on the platform in March 2026.
Use Cases
Classify document_type among 36 categories using text content.
Analyze temporal trends in legal issuance from 1924 to 2026.
Model relationships between issuing_authority and document topics.
Train named entity recognition models to extract legal entities and provisions from document text.
Strengths
Large scale of 518,255 documents provides substantial text data.
Broad temporal coverage spans over a century from 1924 to 2026.
Diverse sources from 2,393 unique issuing authorities.
Limitations
Specific column structure and metadata fields are unknown, limiting structured analysis.
Potential class imbalance across 36 document types is unverified.
Data freshness for recent years (up to 2026) may involve projected or placeholder entries.
Provenance
Source
thuvienphapluat.vn, described as the largest Vietnamese legal document repository.
Collection Method
Sourced and compiled from the online repository by the dataset author.
Time Range
1924 – 2026
Freshness
Last updated on the platform on 2026-03-19.
Geography
Vietnam
License terms are unknown and should be verified before use. The full description with complete metadata is hosted externally on the Hugging Face dataset page.