Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
168,672 datasets
Antarctic marine geoscience data collected under a program by the Bureau of Mineral Resources (BMR). The dataset is published by the Australian Ocean Data Network on data_gov_au. It was last updated on 2026-06-16.
The formation of the Great Barrier Reef is a dataset from the Australian Ocean Data Network. It is published on data_gov_au and was last updated on 2026-06-16. The dataset likely contains information related to the geological and biological history of the reef.
Australian Ocean Data Network provides a legacy dataset on shallow-marine environments mapped via remote-sensing techniques. The dataset is published on data_gov_au and was last updated on 2026-06-16. Metadata is minimal, with no abstract or column details available.
A 2022 modern German Bible translation, explicitly published into the public domain. The dataset contains 61 of the 66 canonical biblical books, aligned with OSIS verse IDs. It was uploaded by the author 'Geliebter' to Hugging Face and last updated on June 14, 2026.
A benchmark dataset capturing test results for agentic coding tasks performed by local LLMs on consumer hardware. The dataset likely contains performance metrics from tests run on an NVIDIA RTX 4060 Ti 8GB GPU and Intel i7-14700F CPU system. It was created by witcheer and uploaded to Hugging Face in May 2026.
Tables of genetic variants identified in OMM12 species and Escherichia coli isolates. The data originates from bacterial strains re-isolated from long-term-colonized gnotobiotic mice. The dataset is 426.8 KB in size, authored by AGSTECHER, and was last updated on May 26, 2026.
Major incidents on the Metro-North Railroad are defined as events causing ten or more trains to be delayed over 5 minutes, canceled, or terminated. The dataset, sourced from data.ny.gov, includes counts of affected trains segmented by peak and off-peak periods. It was last updated on May 15, 2026.
BMR Geoscience Research Cruise 95 dataset concerns Triassic and Jurassic sequences of the Northern Exmouth Plateau and Offshore Canning Basin. It is a legacy product published by the Australian Ocean Data Network on data_gov_au. The record was last updated on 2026-06-16.
Legacy product from the Australian Ocean Data Network focusing on seabed geology and resource potential. The dataset covers three major submarine features in the southwest Pacific Ocean: the Macquarie Ridge, Norfolk Ridge, and Lord Howe Rise. Its content is available in HTML and PDF formats, but no abstract or detailed metadata is provided.
A dataset from the Deep Sea Drilling Project's Leg 28, focusing on diachronous biogenic facies and their palaeoclimatic significance. The data is hosted by the Australian Ocean Data Network and was last updated on 2026-06-16. Legacy product metadata is minimal, with no abstract available.
Corrections for Marine Heat Flow Measurements is a dataset published by the Australian Ocean Data Network on data.gov.au. The dataset likely contains correction factors or adjustments for marine heat flow measurements. Its last update was recorded on 2026-06-16.
GENEB is a multi-task benchmark for DNA sequence encoders. The dataset provides task-level sequence classification data for evaluating precomputed genomic embeddings. It was authored by 'darlednik' and is scheduled for a full release update on June 8, 2026.
1,507 human annotations from a study titled 'When does autoresearch need a human?'. ProlificAI collected these evaluations from 300 participants assessing models generated by Karpathy's autoresearch on a DPO task. The dataset includes per-pair statistics, Bradley-Terry rankings, and LLM-clustered comment themes.
2020-2021 survey records characterizing the homeless population in the municipality of Palmira, Colombia. The dataset includes demographic, health, and social service variables, sourced from the Colombian open data portal datos.gov.co. It was last updated on the platform in May 2026.
Shallow reef structure data from the southern Great Barrier Reef, published via the Australian Ocean Data Network. The dataset is listed as a legacy product with no abstract available. It was last updated on 2026-06-16 21:03:30.607448.
Palynology data from marine Lower Cretaceous strata in the northern and eastern Eromanga Basin, Queensland. The dataset is published by the Australian Ocean Data Network on data_gov_au. It was last updated on 2026-06-16.
Aguas de Palmira's 2022 registry catalogs the company's information assets. The dataset, published on datos.gov.co, includes columns describing the storage medium, publication status, format, and content description for each asset. Its last update was recorded on 2026-05-18.
WMS service provides geometries for valid 5-digit postal code delivery areas across the Federal Republic of Germany. The data originates from Deutsche Post Direkt GmbH and is maintained by the Bundesamt fΓΌr Kartographie und GeodΓ€sie. Postal code boundaries are not always congruent with administrative territorial units.
Marine heatflow system data from the Australian Ocean Data Network. The dataset likely contains information on instrumentation and techniques used for measuring heat flow in marine environments. Metadata is minimal, with the last update recorded as 2026-06-16.
Gideon Abegunrin collected this research data through a systematic search. The dataset is published on figshare under a CC-BY-4.0 license and was last updated in May 2026. Its total size is 913 bytes.