Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
157,787 datasets
2006 monthly soil erosion rates measured in tonnes per hectare per month across hillslopes in New South Wales. The dataset was produced by the NSW Department of Climate Change, Energy, the Environment and Water and is available in PDF and GEOTIFF formats.
Unbiased Ventures developed a transparent methodology for scoring startup pitch decks. The framework evaluates decks across 8 dimensions with stage-aware weighting and benchmarks against 6,586 companies. The dataset documents this methodology, last updated in June 2026.
Australia's marine geology and phosphorite resources are the focus of this legacy report from the Australian Ocean Data Network. The document likely contains evaluations and recommendations for a national marine geology program. It was last updated on 2026-06-16.
Individual Monthly Hillslope Cover Erosion (t.ha-1.month-1) over New South Wales for 2012. The data is provided by the NSW Department of Climate Change, Energy, the Environment and Water and is available in PDF and GEOTIFF formats.
BC Parks and Protected Areas established from 1911 to 2011, including year of establishment and size in hectares. The dataset supports the 'Trends in Ecosystem Protection' indicator published by Environmental Reporting BC. It is provided by the Government of British Columbia.
EPM Aguas Nacionales and the CREG share indices of classified and reserved information as stipulated by Colombia's Transparency and Access to Information Law 1712 of 2014. The datasets likely contain records detailing the legal classification status of public information, including responsible parties, justification, and duration. These indices serve as instruments for public information management.
The BC Address Geocoder is a REST API provided by the Government of British Columbia. It resolves physical locations of addresses and place names in British Columbia to latitude and longitude coordinates. The service also offers address correction, reverse geocoding, intersection location, and parcel identification.
Indicators track the progress of the 2020-2023 indicative plans for management reporting by sectoral entities in the Department of Boyacá, Colombia. The data includes partial progress up to November 2023 and was supplied by the Secretariat of Planning. The dataset was last updated on the platform in May 2026.
From 1911 to 2011, this dataset tracks the area coverage of biogeoclimatic zones within established British Columbia Parks and Protected Areas. It contains the results reported by Environmental Reporting BC in a 2012 indicator summary. The data is provided by the Government of British Columbia.
Historical public data from Deutsche Bahn, the largest train company in Germany. The dataset includes monthly processed files containing train schedules, delays, and cancellations from stations across the country. It is maintained by the author 'piebro' on Hugging Face, with a last recorded update in June 2026.
Recent marine sedimentation on the continental shelf south of Lae, New Guinea is a dataset published by the Australian Ocean Data Network on data_gov_au. The dataset likely contains information about sediment deposits in a specific marine region. No abstract or detailed metadata is available, as it is listed as a legacy product.
Warden-01 is a manually curated dataset of 1,500 penetration testing sessions for training autonomous bug bounty hunting agents. It was created by author yamura4 and last updated on June 18, 2026. The dataset is structured in OpenAI SFT format, containing messages for system, user, assistant, and tool roles.
1505 mapped glacial cirques across the Iberian Peninsula, excluding the Pyrenees. The dataset was created by Ramón Pellitero through manual interpretation in Google Earth and morphometric analysis using the ACME tool. It includes two shapefiles representing cirque surfaces and their lowest closure points.
Sensor-collected sediment data from the Port Curtis Integrated Monitoring Program for Zone 10b in the upper Boyne Estuary. The data collection period spans from December 2006 to June 2014. The dataset is managed by the Australian Ocean Data Network.
Port Curtis Integrated Monitoring Program (PCIMP) data collected by deployed sensors in Zone 06b of the upper Calliope Estuary. The Australian Ocean Data Network hosts this dataset, which covers a time range from December 2006 to June 2014. The specific data format and column details are not provided in the available metadata.
Powidla created a synthetic collection of environmental tabular datasets for machine learning. The data was generated by solving metabolic models for 10,000 pairs of bacteria. It was last updated on June 10, 2026.
Barrios a la Obra program interventions for new roads in the District of Barranquilla are mapped over a four-year period. The dataset includes columns such as UBICACION, BARRIO, and ESTADO to describe the location and status of each project. It is hosted on the Colombian open data portal www.datos.gov.co and was last updated in May 2026.
Marion Plateau, off the coast of northeast Australia, is the focus of this dataset. It likely contains geological records of sea-level changes. The data is a legacy product from the Australian Ocean Data Network, with no abstract available.
2009 Monthly Hillslope Cover Erosion provides monthly soil erosion rates in tonnes per hectare per month across New South Wales for the year 2009. The dataset was published by the NSW Department of Climate Change, Energy, the Environment and Water. Data is available in PDF and GEOTIFF formats under a CC-BY-4.0 license.
Port Curtis Integrated Monitoring Program (PCIMP) data collected by sensors deployed in Zone 01, The Narrows. The Australian Ocean Data Network hosts this sediment monitoring dataset. Data collection occurred from December 2006 to June 2014.