Loading...
Loading...
Legislative text, court decisions, regulatory filings, patents, government contracts, election data
9,662 datasets
A 50-million-token text dataset focused on Indian law. The description indicates it contains chain-of-thought legal reasoning. The dataset is hosted on Kaggle, but details on its author, organization, and specific source are unknown.
A dataset concerning the regulation of tourism and cultural districts. The data likely contains information relevant to balancing cultural identity, visitor management, and economic impact. It was sourced from Kaggle, but specific details on its creation, size, and update frequency are not provided.
preprocessed-data-videos4 is a video dataset from Kaggle. The description indicates it has been preprocessed using a motion-based frame selection method, with a TOP_K parameter set to 16. Further details on the source, size, and specific content are not provided.
Economic Law Semantic Case Matching Dataset is a collection of structured legal cases intended for deep learning semantic mining and similarity analysis. The dataset's author, organization, size, and update date are unknown. It is hosted on Kaggle.
Marriage Guest Dataset is a sample dataset for training machine learning models to predict budgets for a marriage. The dataset is hosted on Kaggle, but its author, organization, and specific creation details are not provided. The number of rows, columns, and the last update date are unknown.
Unit Income Rent data details maximum allowable household income and initial legal versus actual rents for apartments in New York City housing developments receiving city financial assistance. The dataset is reported by building and bedroom size under Local Law 44 of 2012. It is published by the City of New York and was last updated in November 2025.
Data from data.cityofnewyork.us provides records of expenditures made by election campaigns. The dataset includes columns such as COMMITTEE, CANDLAST, AMNT, PURPOSE, and STATE, suggesting detailed tracking of financial transactions. It was last updated on 2025-12-19 15:35:06.
40 state legislative district upper chamber (SLDU) boundaries for Florida, reflecting redistricting data provided to the U.S. Census Bureau by May 31, 2024. The dataset is part of the national TIGER/Line series, designed to stand alone or be combined for national coverage. Boundaries for states like Michigan are not current due to pending legal changes.
Geographic shapefiles for Florida's congressional districts as defined for the 119th Congress, seated from January 2025 to December 2026. The data is an extract from the U.S. Census Bureau's MAF/TIGER System, designed to be a seamless national file. Boundaries reflect state-provided information as of May 31, 2024.
A shapefile containing the geographic boundaries for Illinois's congressional districts for the 119th U.S. Congress, seated from January 2025 through December 2026. The data is part of the U.S. Census Bureau's TIGER/Line series, designed as a standalone extract from the national MAF/TIGER System. Boundaries reflect state-provided information as of May 31, 2024.
TIGER/Line shapefiles provide the geographic boundaries for Louisiana's State Legislative District Upper Chamber (SLDU) as of May 31, 2024. The data is part of the U.S. Census Bureau's national MAF/TIGER System, designed to be a seamless, standalone extract for Louisiana. Boundaries reflect official state submissions, with a unique three-character census code assigned to each district.
A TIGER/Line shapefile from the U.S. Census Bureau containing the geographic boundaries for the single, non-voting delegate congressional district in the District of Columbia for the 119th Congress. The data reflects state-provided boundaries as of May 31, 2024, for the congressional session seated from January 2025 to December 2026.
This repository by vtasca provides scraped text from Federal Open Market Committee (FOMC) meeting statements and minutes, tracking US monetary policy changes through February 2026. The data captures official communications from the Federal Reserve, including both high-level policy summaries and detailed meeting records.
U.S. Census Bureau provides TIGER/Line shapefiles for State Legislative District Upper Chambers (SLDU) across the United States. These boundaries reflect state-submitted information as of May 31, 2024, with updates noted for states like Georgia, Minnesota, and Ohio. The data is designed as a seamless national file with no overlaps or gaps.
Rock-like models containing a single pre-existing flaw provide data for fracture mechanics analysis. The dataset's size, creator, and update date are not specified. It originates from Kaggle under the Engineering and Computer Science domains.
Patent records from China covering a 40-year period from 1985 to 2025. The dataset is hosted on Kaggle, but the specific source, author, and detailed contents are not provided in the metadata. Columns, sample data, and file formats are currently unknown.
Comprising synthetic logical reasoning traces categorized into correct and flawed classes for AI Mathematical Olympiad (AIMO) problems. It provides a collection of step-by-step mathematical proofs designed to help models identify specific points of failure in complex problem-solving.
Law_of_VietNamese is a dataset published on Kaggle. The title suggests it contains legal texts, statutes, or regulations from Vietnam. The dataset's specific content, size, and origin require verification after download.
Voting results by riding from the 2019 Canadian Federal Election. The dataset is hosted on Kaggle and likely contains tabular data on electoral outcomes. The specific columns, size, and license details are unknown.
A dataset for predicting job selection outcomes, likely containing features related to candidates and job roles. It is hosted on the Kaggle platform, but details on its size, origin, and creation date are not provided. The dataset's specific variables and scope must be verified after download.