Loading...
Loading...
Legislative text, court decisions, regulatory filings, patents, government contracts, election data
9,661 datasets
Approximately 8 million U.S. patent grants and applications from 1976 to 2025, cleaned and formatted for language model pre-training. The dataset was created by AllenAI and last updated on December 18, 2025.
Encompassing information on broadband connectivity and the Universal Broadband Fund for communities north of the 53rd parallel in Manitoba, Canada. It was created by Shirley Delorme Russell and last updated in February 2026. The specific number of rows and columns is unknown.
New Debt Issuance for State Authorities contains records of new debt issuances reported by public authorities in New York State. The dataset covers 8 fiscal years, including the most recently completed calendar year. It is published by data.ny.gov based on reports mandated by Section 2800 of the Public Authorities Law.
Campaign Contributions data from data.cityofnewyork.us details financial donations to election campaigns. The dataset includes fields for donor, recipient, amount, date, and occupation, with a last update timestamp of December 19, 2025. It is published by the City of New York.
This replication package supports research on the strategic use of job titles to avoid overtime payments. The dataset is authored by Lauren Cohen and hosted by Harvard Dataverse, with a last update in February 2026. The specific row count, column structure, and file formats are not provided.
New Debt Issuance for Industrial Development Agencies contains records of new debt obligations reported by these public authorities in New York State. The dataset covers eight fiscal years, including the most recently completed calendar year. Data.ny.gov publishes this information as mandated by Section 2800 of the Public Authorities Law.
1 governance framework document covering Global AI Sovereignty™. The content outlines the Christine Classy approach to international AI policy and governance. This framework addresses the regulatory requirements for the new era of global AI sovereignty.
A source of replication materials for a study on the supply and demand determinants of heterogeneous Value-Added Tax pass-through. The data supports the analysis published in the National Tax Journal. Specific details on row count, column count, and file formats are unavailable.
This dataset supports a study examining how the public assesses claims for relief, compensation, or regulatory accommodation under conditions of structural carbon constraint. The data was authored by Seungwoo Han and last updated in February 2026.
Comprising concordance lines for the word 'constitution' with 50 words of context on either side, extracted from the COFEA database. The data covers the period from 1760 to 1799 and includes scripts used to train an AI model. Row, column, and size information is not available.
This dataset supports replication for a study on ideological asymmetries in trust in elections and non-voting political participation. It contains data related to operational ideology and social science research. The specific number of rows, columns, and data structure are not provided in the input.
A collection of 325 HTML games generated using the Gemini 3 AI model and manually curated by the author limanox. The dataset was last updated on January 30, 2026. Its specific content and structure are described as 'very simple'.
A set of between 1,000 and 10,000 text records of raw witness statements and unverified allegations related to the Jeffrey Epstein case, compiled by theelderemo. Updated in late 2025, the data consists of OCR-processed documents sourced from FBI files and investigative journalism. The repository includes graphic material regarding sexual abuse and trafficking.
21 years of national judicial data tracking pending civil and criminal cases in Italy from 2003 to 2024. The dataset provides annual counts of unresolved legal proceedings to measure the workload and efficiency of the Italian justice system.
A dataset from Kaggle listing tourism-related entities in Malawi. The description indicates it includes accommodation, places to visit, and transport information. The author, organization, and specific temporal coverage are unknown.
Federal Reserve FOMC meeting policy statements from 1992 onward. The dataset likely contains the official textual records of Federal Open Market Committee meetings. It is sourced from Kaggle and appears to be auto-updated.
Funded applications for capital grants allocated annually by the New York City Council to not-for-profit organizations. The data details projects involving property acquisition, construction, or equipment purchases that must serve a public purpose. It originates from the City of New York and was last updated in January 2026.
Survey data from the Norsk Gallup Instituut, covering topics such as disarmament plans, liquor sale regulations, election laws, and the effect of taxes on work. The specific number of rows, columns, and sample data are unavailable.
Statewide Price Agreement Spend - Multi-Year Report summarizes quarterly spending by Oregon state agencies and cooperative procurement participants. Data aggregates vendor-submitted Volume Sales Reports for calendar years 2022 through 2024. The dataset is provided by data.oregon.gov and was last updated in November 2025.
Kaggle dataset titled 'Korea Presidential Election Data 2025'. The dataset likely contains information related to the 2025 presidential election in South Korea, such as candidate results, voter demographics, or polling data. Its specific contents, size, and origin are not detailed in the provided metadata.