Loading...
Loading...
Legislative text, court decisions, regulatory filings, patents, government contracts, election data
9,551 datasets
Isaacus released this corpus of 229,122 Australian legislative and judicial documents in 2026 for legal AI research. It contains over 60 million lines and 1.4 billion tokens sourced from the Commonwealth and various state-level legislative registers.
Canadian government procurement data includes contracts, tenders, awards, and disclosures. The sample contains 5,000 rows of records. Its origin and specific time range are not detailed in the provided metadata.
Development plans of the local municipality of Nickenich establish boundaries for a built-up district in the area of Untere Wiesenstraße, Wiesenstraße and Wiesenpfad. The dataset is provided by the Bundesamt für Kartographie und Geodäsie and was last updated on March 6, 2026.
Interview scripts, photographs, and legal documents support a study exploring enforcement lag in urban e-bike governance. Materials include an interview outline, coded qualitative data, on-site observation photos, and third-party media images. Author PENG, MINGGANG deposited this collection in the Harvard Dataverse, last updated in April 2026.
Supplementary Material 3 from a study on metabolic dysregulation in Gaucher disease. The XLSX file, 80,602 bytes in size, contains data for constraint-based modelling, focusing on mitochondrial dysfunction and cholesterol homeostasis. It is published under a CC-BY-4.0 license.
Supplementary Material 2 provides tabular data supporting a constraint-based model of metabolic dysregulation in Gaucher disease. The dataset focuses on mitochondrial dysfunction and disrupted cholesterol homeostasis, likely containing quantitative measurements for metabolic network analysis. It is shared under a permissive CC-BY-4.0 license.
Resampling procedures to assess variable selection stability with finite sample error control, implemented by Benjamin Hofner. The package implements standard stability selection and complementary pairs stability selection, as described in referenced academic papers. It is designed for use with high-dimensional variable selection procedures such as Lasso or boosting.
robustHD is an R package implementing robust methods for high-dimensional data, authored by Andreas Alfons. The package specifically provides robust least angle regression, robust groupwise least angle regression, and sparse least trimmed squares regression techniques. The methods are based on published research from 2007, 2013, and 2016.
A Bayesian supervised learning approach identifies individual inventors from the U.S. utility patent database from 1975 onward. Ronald Lai of Dana-Farber/Harvard Cancer Center provides descriptive statistics and an interface to calculate patent co-authorship networks without predefined bounds. The data and code are offered for open development by the research community.
The dataset likely contains country-level data used to analyze ratification of the Rome Statute establishing the International Criminal Court. It was created by Terrence L. Chaudoin Chapman for research examining state participation patterns. The analysis contrasts findings with a 2010 article by Simmons and Danner.
Historical CO2 records derived from three ice cores drilled at Law Dome, East Antarctica between 1987 and 1993. The data is presented by David Etheridge of the Commonwealth Scientific and Industrial Research Organisation and is hosted on the CDIAC data transition website. The Law Dome site is described as having high snow accumulation, low impurities, and undisturbed stratigraphy, making it suitable for atmospheric CO2 reconstructions.
Millburn, New Jersey, is the geographic focus of this dataset. The data likely contains information related to a special election and the intensity of public debate surrounding it. It was published on Kaggle, but details about its creation, size, and specific contents are unknown.
A dataset of 50,000 Chinese text samples for intent classification, created by author trytax. The data is synthetically generated and includes labels for intent and domain. It was last updated on March 13, 2026.
Wake County, North Carolina, belongs to the tenth district of the state's Superior Court system. The dataset includes polygons and labels representing subdivisions of this district, which are used for electoral purposes. It is published by Wake County under a CC-BY-4.0 license and was last updated in March 2026.
A dataset of assets that have been or are being retired from the State of Connecticut's open data portal. The data likely includes records for datasets retired due to age, factual inaccuracies, low usage, or replacement by another asset. The dataset was last updated on March 8, 2026.
An R package providing functions for model selection and multimodel inference based on Akaike's information criterion (AIC, AICc) and their quasi-likelihood counterparts (QAIC, QAICc). The package, authored by Marc J. Mazerolle, implements model averaging for parameters or predictions, includes diagnostics for certain model types, and supports Bayesian models from 'bugs', 'rjags', and 'jagsUI' classes. It also allows for model selection using BIC and can format results to LaTeX.
A collection of over 3,600 padel clubs across 10 European countries. The dataset likely contains GPS coordinates, court information, and booking platform details. It was sourced from the Playtomic and Anybuddy platforms and shared on Kaggle.
Washington State's Public Disclosure Commission provides a list of all campaign finance reports, including C3, C4, C5, C6, and LMC forms, filed over the last 10 years. The dataset tracks report amendments and filing history for candidates and political committees, with the reporting period determined by election year or calendar year. It is intended for examining reporting timelines and amendment sequences.
Washington State Public Disclosure Commission data summarizes candidate campaign and political committee financial activity. Records are updated in near real-time, typically less than 2 minutes after campaign submission. The dataset covers the prior 16 years plus the current election year.
Property Legal Descriptions is a dataset of legal descriptions for land parcels in King County, published by data.kingcounty.gov. The dataset includes columns such as Plat Lot / Major, Plat Block / Minor, Legal Description, Account Number, and Parcel Number. It was last updated on March 2, 2026.