Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Encompassing legal documents and court opinions parsed from the CourtListener platform by author thoppe, last updated in June 2023. It provides a structured pipeline for ingesting FreeLaw project data into The Pile, a large-scale language modeling corpus.
This repository functions as a data-ready pipeline; users may need to execute the provided parsing scripts to generate the final dataset from the CourtListener source.