Loading...
Loading...
Legislative text, court decisions, regulatory filings, patents, government contracts, election data
9,670 datasets
Rachel Ellett created this dataset for a book analyzing judicial empowerment pathways in Uganda, Malawi, and Tanzania. The data likely contains records of politically salient court cases identified through interviews, newspaper coverage, and secondary literature. It was last updated on October 20, 2025.
Voter turnout trends in New York City from 2017 onward are reported at citywide, borough, and political district levels. The dataset is maintained by the City of New York and was last updated in December 2025.
700,000 Turkish legal documents from the Yargıtay and Danıştay courts are organized via multiple embedding models and clustering algorithms. These records represent the primary sources of legal precedent in Turkey for civil and criminal cases.
1,000 to 10,000 records benchmark safety-utility trade-offs across 12 Large Language Models in the legal domain, published by marvintong in 2025. The data includes legal questions, multi-phase evaluations, and contract text to measure model performance and over-refusal tendencies. It is structured into distinct subsets for questions, evaluations, and legal documents.
Greek Supreme Council for Civil Personnel Selection (ASEP) provides 1,200 multiple-choice questions in the Greek language. The dataset was created by ilsp and extracted from official ASEP materials, with a last update recorded on 2025-11-07. Questions cover domains such as constitutional law.
Two data tables contain the percentage of licensed tobacco retailers that sold tobacco to underage youth and young adults. The California Youth Tobacco Survey (YTPS) assessed sales to youth under 18 from 1997 to 2018, succeeded by the annual Synar Tobacco Purchase Survey (STPS) for young adults under 21 starting in 2019. Data collection is mandated by California's STAKE Act and the federal Synar Amendment.
Geoffrey Swenson's Annotation for Transparent Inquiry (ATI) data project accompanies an analysis of U.S. rule-of-law promotion efforts in Afghanistan. The dataset likely contains qualitative data supporting a process-tracing causal inference study of major initiatives from 2002 to 2014. It was harvested by QDR and last updated on October 20, 2025.
300 unique conceptual tags organize constitutional excerpts from nearly all independent states in force as of December 26, 2017. The dataset was developed by Zachary Elkins for the Comparative Constitutions Project and deposited with QDR. It includes cleaned and tagged texts from in-force constitutions, informed by a public-facing design principle.
Ezequiel Gonzalez Ocantos collected documents for a project analyzing variation in human rights prosecutions in Latin America. The data includes court rulings from Argentina, NGO archives from CELS, and Mexican news articles. The project led to publications in 2016 and 2014.
Lima's street vendor politics are documented in an archive of over 1,000 pages compiled between October 2011 and September 2012. The archive includes newspaper clippings, organizational records, and municipal by-laws from the district of La Victoria, focusing on three municipal administrations from 1992 to 2002. Sally Roever created this collection as part of a research project on the informal economy.
Federal Court Cases originate from 100 court offices throughout the United States, providing an official public record of federal court business. The data collection, updated bi-annually, contains information obtained at case filing and termination points, with the unit of analysis being a single case for appellate and civil data and a single defendant for criminal data.
Juvenile Court Statistics Series is the oldest continuous source of information on the processing of delinquent and dependent youth by juvenile courts, inaugurated in 1926. It provides annual data on the volume of delinquency, status offense, and dependency cases disposed by courts with juvenile jurisdiction, distinguishing cases with and without a petition filing. The data includes counts at the state and county levels.
Data.cityofchicago.org provides a list of entities and individuals debarred from doing business with the City of Chicago. Records include debarment date, length, reason, and location details. The list was last updated on September 27, 2025.
A dataset titled 'Indian Legal' was published on the Hugging Face platform by author 'antonhome' and last updated on 2025-12-21. The dataset likely contains text data related to the Indian legal system, such as court documents, case summaries, or statutes. Column names and specific content are unknown and require verification after download.
A calendar of important elections dates and deadlines provided by the Oregon Secretary of State. The dataset is published by data.oregon.gov and was last updated on October 15, 2025. It includes columns such as Date, End Date, Election, Description of Event, and Reference.
Elliot Posner created this dataset to analyze financial regulatory patterns in the United States and European Union, the two jurisdictions most responsible for setting global trends. The data was generated to address a lack of knowledge accumulation in the qualitative literature, particularly around the 2008 international financial crisis. It was last updated on October 20, 2025.
Australian Tax Guidance Retrieval by Isaacus is a legal information retrieval dataset consisting of 112 real-life Australian tax law questions paired with expert-annotated, relevant Australian Government tax guidance and policies. The real-life tax questions are sourced from posts by everyday Australian taxpayers on the ATO Community forum. The dataset was last updated on 2025-10-23.
Department of Justice data provides state and county juvenile court case counts for delinquency, status offense, and dependency cases. The data originates from the National Juvenile Court Data Archive and is supported by the Office of Juvenile Justice and Delinquency Prevention (OJJDP). Row and column counts are unspecified.
An ATOM feed provides access to the original development plan for inner-area statutes in Watenstedt, part of the Samtgemeinde Heeseberg. The data originates from the Bundesamt für Kartographie und Geodäsie and was last updated on November 4, 2025. The feed delivers the plan in its original data format.
FAPO Critic contains constructed benchmark data for training a generative reward model. The dataset was created by author dyyyyyyyy for the FAPO research project and was last updated on the platform in October 2025. It is sourced from ProcessBench and forms the FlawedPositiveBench used to train the FAPO-GenRM-4B model.