Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
The Pile Of Law is a curated corpus of legal and administrative data intended for analyzing data filtering norms and pretraining legal-domain language models. The dataset is monolingual (English) and originates from US sources. Its size is categorized as between 10 million and 100 million units.
License is indicated as 'cc By Nc Sa 40' (Creative Commons Attribution-NonCommercial-ShareAlike 4.0), which restricts commercial use. The dataset is monolingual (English).