Loading...
Loading...
Legislative text, court decisions, regulatory filings, patents, government contracts, election data
9,453 datasets
WildClawBench is a benchmark containing 60 original tasks for evaluating AI agents within a live OpenClaw environment. It tests agents on end-to-end, practical work such as clipping football highlights and negotiating meeting times. The benchmark is multimodal, supporting languages including English and Chinese, and was created by internlm.
50,000 synthetic candidate profiles intended for recruitment and machine learning research. The dataset was sourced from Kaggle, but its author, creation date, and specific contents are not detailed. Its primary purpose is to support the development of automated resume screening and candidate selection algorithms.
Monthly newsletter from the Directorate of Nuclear Substance Regulation (DNSR) sent to all licensees. It contains articles on regulatory requirement updates, reportable event trending, compliance guidance, and lessons learned.
Synthetic multi-agent workflow traces with LLM-enriched content for the legal-document-analysis domain. The dataset contains 1,498 events across 50 workflow runs, each representing a complete multi-agent execution trace. It was created by juliensimon and last updated on March 29, 2026.
516 maps cover the entire continent of Australia at a scale where 1cm represents 2.5km. Each standard map depicts an area of approximately 1.5 degrees longitude by 1 degree latitude, showing natural and constructed features. The series is produced by Geoscience Australia.
Temperature-depth profiles were collected in the North Atlantic Ocean using expendable bathythermograph (XBT) instruments from NOAA Ship Delaware II. The data, processed by the National Oceanographic Data Center (NODC) into the C116 format, covers a specific cruise from January 20 to 25, 1971. This dataset captures temperature values at non-uniform depths, recorded at inflection points to define the ocean's thermal structure.
NODC-processed bathythermograph data captures ocean temperature-depth profiles from the Cape Roger vessel in the Gulf of St. Lawrence and North Atlantic Ocean. The dataset uses the C116/C118 format, recording pairs of temperature and depth values at non-uniform inflection points to define the thermal curve. It represents a specific nine-day mission from March 17 to March 25, 1988, archived by NOAA NCEI.
Bathythermograph (XBT) data from a NOAA research cruise in the North Atlantic Ocean. The dataset contains temperature-depth profile pairs recorded at non-uniform inflection points to define the ocean's thermal curve. It was processed by the National Oceanographic Data Center (NODC) into the standard C116 format.
A report details judicial processes subject to control by the Municipal Comptroller's Office of Itagüí, Colombia. The data provides a snapshot as of March 5, 2021, sourced from the municipal administration. It was published by the Colombian open data portal datos.gov.co.
India Legal Document Simplifier Dataset likely contains legal texts from India intended for simplification tasks. Published on Kaggle, its specific contents, size, and creation details require verification after download. The dataset's author, organization, and last update date are unknown.
NOAA NCEI provides ocean station and selected depth bathythermograph data collected from vessels like CARIBIA EXPRESS and USCGC Chase. The dataset spans from September 1968 to December 1984, covering the Caribbean Sea and North Atlantic Ocean. Observations were processed by the National Oceanographic Data Center into standard C100 and C125 formats, containing physical-chemical measurements at discrete depth levels.
New York City agency spending and budget data reported in the Preliminary Mayor's Management Report (PMMR). The dataset includes financial plan and expenditure figures for city agencies, compiled by the Office of Management and Budget. The data was last updated in March 2026.
A dataset of 68 days of bathythermograph (XBT) temperature-depth profiles collected by NOAA Ship Delaware II in the North Atlantic Ocean. The data was processed by the National Oceanographic Data Center (NODC) into the standard C116 format, which records temperature at non-uniform inflection points to define the profile curve. This historical dataset is part of NCEI Accession 8100062.
Environment Agency data details actions taken to achieve water body objectives under the Water Framework Directive. The dataset supports river basin management plans for the Anglian groundwater catchment, predicting status improvements by 2021. It is published by the UK Government Digital Service and available for interactive viewing.
139 agent tasks across general and multimodal splits evaluate real-world AI agent performance. The benchmark covers 24 categories including communication, finance, and operations, created by claw-eval. It was last updated in March 2026.
2.2 million decision states were collected from real Orbit Wars games played by high-skill players with an Elo rating of 1650 or above. The data includes both value and policy information, likely representing game states and player actions. The dataset was uploaded to Kaggle, but the author, organization, and specific collection details are unknown.
A dataset for fine-tuning models for translations between English and Tumbuka, a language spoken in Malawi. The dataset's author, organization, size, and specific contents are unknown. It was sourced from the Kaggle platform.
Legal_text_simplification is a dataset hosted on Kaggle. The title suggests it contains legal documents or texts processed for simplification. Metadata is minimal; actual content requires verification after download.
A Web Map Service (WMS) layer provides the development plan for the Volkmarsdorf Süd area of the Velpke municipality. The data is encoded in the XPlanGML Version 5.4 format, a standard for urban planning information in Germany. The dataset is provided by the Bundesamt für Kartographie und Geodäsie and was last updated on April 1, 2026.
London's Central Activities Zone boundary as proposed in the 2009 London Plan consultation. The data is provided by the Greater London Authority for illustrative use only, representing a non-finalized planning area. It was last updated in the platform on 2026-03-25.