Loading...
Loading...
Legislative text, court decisions, regulatory filings, patents, government contracts, election data
9,662 datasets
A dataset of molecular dynamics trajectories for peptides, maintained by the author 'transferable-samplers'. The description indicates a critical update was made on 2025-12-15 to correct 8AA TICA models that originally used an incorrect CA-only atom selection. The dataset is hosted on Hugging Face.
Malawi Dataset is a collection of data from Malawi, published on Kaggle. The dataset likely contains information related to the country's geography, culture, or activities. Its specific content, size, and structure require verification after download.
A capstone project dataset likely designed for simulating financial scenarios and regulatory impacts. The dataset was published on Kaggle, but its specific creation date and temporal coverage are unknown. It appears to be associated with an AI agent named Finpro.
Swiss legal documents, likely indexed for search or analysis. The dataset is hosted on Kaggle, but its specific contents, size, and authorship are unknown.
data.oregon.gov provides the Oregon State Budgeted Revenue Report for the 2025-27 biennium. It details revenue information organized by state agency and fund type, sourced from the Oregon Legislature and the Oregon Judicial Department via the DAS Chief Financial Office's ORBITS system. The dataset was last updated in October 2025.
Municipal grant and reimbursement amounts for programs administered by Connecticut's Office of Policy and Management. Data covers fiscal years from 2010 through 2025 and is published by data.ct.gov. The dataset was last updated in November 2025.
A dataset for evaluating knowledge-related tasks, created by MRMRbenchmark and last updated in December 2025. The evaluation code is implemented based on the MTEB framework and is available in a linked GitHub repository. The dataset description emphasizes strict compliance with copyright and licensing rules from the original data sources.
This dataset documents a deselection list from a library. It was created by St. Mary's Library Test and last updated in February 2026. The specific number of records, columns, and data fields are unknown.
Two distinct data sources comprising historical FiveThirtyEight archives and modern 2024 GitHub mirrors. Files are organized into pre-2023 and post-2024 categories to facilitate longitudinal analysis of statistical journalism.
A Web Map Service (WMS) provides the legal statute for land consolidation in the Bautel area of Leiwen municipality. The Bundesamt für Kartographie und Geodäsie published this geospatial legal document, which was last updated on December 11, 2025. Its content likely defines the boundaries and regulations for land consolidation projects.
WFS INSPIRE special urban planning law RP Tübingen is a Web Feature Service (WFS) providing geospatial data on special urban planning regulations for the Tübingen region in Germany. The service is transformed according to the INSPIRE directive and is based on an XPlanung dataset in version 6.0. It is provided by the Bundesamt für Kartographie und Geodäsie and was last updated on 2025-12-19.
Kenya_elections is a dataset hosted on Kaggle. The dataset likely contains information related to electoral processes in Kenya. Metadata is minimal; specifics regarding columns, size, and provenance are unknown.
Legal LLM Stage 1 HF Datasets likely contains text data intended for training large language models in the legal domain. The dataset is published on Kaggle, but its specific content, size, and authorship are unknown. Its title suggests it may be part of a staged training process for legal AI applications.
Legal-LLM-stage2-processed likely contains text data processed for a second stage of a legal language model project. The dataset is hosted on Kaggle, but its author, organization, and creation date are unknown. Columns and sample data are unavailable, making a detailed assessment impossible.
Project_Sperm_Selection is a dataset hosted on Kaggle. Its specific contents, size, and origin are not detailed in the provided metadata. The title suggests it likely contains data related to sperm cell analysis or selection processes, potentially for use in fertility research or clinical applications.
Sperm selection data, likely related to assisted reproductive technology and fertility research. The dataset is hosted on Kaggle, but its specific size, origin, and collection date are unknown. Columns and sample data are unavailable for review.
Policy violation detection with good precision, according to the raw description. The dataset is hosted on Kaggle and likely contains logs from enterprise systems. Specific details on volume, creation date, and authorship are unavailable.
ULP Legal dataset is hosted on Kaggle. The dataset likely contains legal documents or text. Metadata is minimal; actual content requires verification after download.
ODLC policy documents published on Kaggle. The dataset likely contains official policy texts, though the specific number of documents, their source, and date range are unknown. Metadata is minimal; actual content requires verification after download.
A dataset titled 'vietnamese-legal-rag-v6' is hosted on Kaggle. The title suggests it contains Vietnamese legal text documents intended for use in Retrieval-Augmented Generation (RAG) systems. No further metadata on size, source, or structure is provided.