Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
168,883 datasets
Reporte de directivos de entidades públicas vigiladas o supervisadas por la SFC lists executives of public entities supervised by Colombia's Financial Superintendency (SFC). The dataset includes columns for person name, identification number, entity name, and role tenure dates. It was last updated on 2026-05-19 via the datos.gov.co platform.
Derwent Estuary in Australia contains temperature and salinity measurements collected via CTD profiling at a single station. The Australian Ocean Data Network gathered this data between August 2012 and January 2013. The dataset is available in NetCDF and PNG formats.
Station 6 in the Derwent Estuary provides a six-month record of oceanographic conditions. The Australian Ocean Data Network collected temperature and salinity samples via CTD profiles between August 2012 and January 2013. Data is available in NetCDF format, a standard for environmental science.
Beginning in 2019, this dataset contains bi-monthly lobbying activity and expenditure reports filed by public corporations and other principal lobbyists in New York. It is published by data.ny.gov and includes detailed columns on lobbying expenses, targets, and subjects. The data was last updated on 2026-04-17 01:24:04.
Observations from 27th and 28th June 1960 compare magnetic field instruments calibrated at Toolangi with the proton precession magnetograph at the Weapons Research Establishment in Woomera, South Australia. The record is provided by Geoscience Australia Data. The dataset was last updated on 2026-05-31.
Government of Yukon's 1992 report consists of three parts: an overview of mining activity, summaries of government geoscience services, and a compilation of geological research papers. The data is published by the Government of Yukon and includes papers searchable as separate metadata records. The dataset was last updated on the platform in April 2026.
The Government of Yukon's report details the geometry and kinematic history of deformation at the Howard's Pass shale-hosted zinc-lead deposits. It presents lithostratigraphic mapping and structural observations from a study focused on the XY group of deposits, aiming to test a regional thrust duplex model. The report was last updated on April 17, 2026.
Mozilla Common Voice Scripted Speech 25.0 provides audio data for the Chinese (Taiwan) language. This mirror by OpenFormosa embeds the audio into Parquet files at a 48 kHz sampling rate. The dataset was last updated on Hugging Face on 2026-06-06.
Yukon Exploration and Geology 1995 is a government report published by the Government of Yukon. It consists of three parts covering mining and exploration activity, government geological services, and new geoscientific findings. The report is available in HTML and PDF formats and was last updated on the platform in April 2026.
FODESEP, the Colombian Fund for Higher Education Development, publishes this register of information assets to comply with transparency law 1712 of 2014. The dataset includes columns for AREA RESPONSABLE DE LA INFORMACION, NOMBRE SERIE, DESCRIPCION DEL CONTENIDO, and CONSERVACION. It was last updated on 2026-05-18 and is available via the Colombian open data portal.
A geotemporal dataset describes protest-related events in Hong Kong during the 2019-2020 Anti-ELAB protests. The data originates from HKMapLive, a crowdsourced map used by protesters to report events and coordinate movements. It covers two periods: from November 14, 2019 to February 23, 2020, and from June 10 to July 21, 2020.
Wikipedia German — Preprocessed is a cleaned version of the German Wikipedia dump, totaling 13.5 GB. The dataset was processed by raj2708 from a dump dated May 2026, sourced from de.wikipedia.org. It is intended for use in large language model pretraining.
A 1.2 MB PDF document from figshare, last updated May 12, 2026, authored by ETD Depositor. It introduces two Torelli subgroups of the handlebody group and describes their abelian quotients using symplectic representation and Johnson homomorphisms. The work analyzes cup products in the first rational cohomology groups of these subgroups.
Statistics Canada's Consulting Services Price Index (COSPI) tracks price changes for professional consulting services. Quarterly data begins in the first quarter of 2014, with the index rebased to 2018=100. The table presents data for the most recent reference period and the last four periods.
Architectural, engineering and related services price index (AESPI) data categorized by the North American Industry Classification System (NAICS). Statistics Canada publishes this quarterly time series starting from the first quarter of 2013, with an index base period of 2018=100. The table includes data for the most recent reference period and the last four periods.
MTA Long Island Rail Road (LIRR) systemwide operational data for elevators and escalators, beginning in 2019. The dataset tracks the percentage of time these assets are available, sourced from data.ny.gov. It was last updated on May 15, 2026.
Adressen en gebouwen (BAG) is the official registration of all buildings and addresses in the municipality of Eindhoven, Netherlands. The data is maintained by the Ministerie van Binnenlandse Zaken en Koninkrijksrelaties and linked to the national BAG system. It includes identifiers, construction year, building contour, usage area, purpose, coordinates, and address details.
Annual summary data on the number of applicants to the Colombian Navy between 2021 and 2024. The dataset includes aggregated information by year, allowing for observation of trends in interest in joining the naval institution. It is hosted on the Colombian open data portal www.datos.gov.co and was last updated in May 2026.
Public forum content collected from the Looksmaxxing Forum for research and analysis. The dataset preserves discussion structure across 20 forum/subforum records and 172,917 thread-level metadata rows. It was created by trentmkelly and last updated on Hugging Face in May 2026.
Metadata from a manuscript investigating the association between gut microbiota and frailty in older women. The dataset is an 84.4 KB XLSX file authored by Mattias Lorentzon and shared under a CC-BY-4.0 license. It was last updated on June 3, 2026.