Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
166,196 datasets
Global Affairs Canada publishes quarterly reports on the issuance of import permits under the Comprehensive and Progressive Agreement for Trans-Pacific Partnership (CPTPP) for Turkey. The data is updated quarterly, with the last recorded update on 2026-06-10. The dataset is released under the OGL-CA-2.0 license.
121 episodes of robot telemetry data generated using the LeRobot framework. The dataset contains 37,727 frames of video and associated data files, recorded at 30 frames per second. It was created by author Tacoin and last updated on June 17, 2026.
EvoCode-Bench contains 26 executable tasks with 227 total rounds for evaluating coding agents in persistent software engineering interactions. The dataset, created by UnipatAI, uses the Harbor multi-step task format and includes workspaces, task metadata, and verification assets. It was last updated on June 20, 2026.
Data.ny.gov provides data on scholarship awards administered by the New York State Higher Education Services Corporation (HESC). The dataset includes the number of recipients and total dollar amounts by college, beginning with the 2009 academic year. It covers scholarships administered by HESC, organized by TAP college codes and sectors.
The Brownfield Register 2018 from Eastleigh Borough Council contains sites considered potentially suitable for housing or housing-led development. All listed sites have been previously developed upon. The dataset was last updated on 2026-06-19 11:02:13.826742.
A report on ambulance quantities categorized by type across municipalities and territorial entities in Colombia. The data includes columns for basic and medicalized ambulance counts, municipality and department names, a total general figure, a report date, and a source field. The dataset was last updated on 2026-05-18 and is hosted by the Colombian open data portal, www.datos.gov.co.
Initial noise and pollution complaints reported to the Noise and Pollution Control Team of Leicester City Council since January 2018. The dataset is hosted on the uk_data platform and was last updated on 2026-06-17. The specific volume of complaints and detailed column structure are not provided in the available metadata.
A dataset from Leicester City Council, last updated on 2026-06-17, describing IT infrastructure components. It likely contains information on storage, servers, networks, security, backup systems, disaster recovery, business continuity, and end-user devices. The data is provided by a UK local government organization.
Leicester City Council provides data on expenditure for consultancy, professional services, and interim staff. The dataset is available in multiple formats including CSV, JSON, and Parquet. It was last updated on 2026-06-17.
Leicester City Council maintains a register of structures and features considered to have a significant effect on flooding. The dataset likely contains an inventory of engineered assets and natural features influencing flood pathways within the city. It was last updated on 2026-06-17.
National Non-Domestic Rates (NNDR) live accounts data for business rates in the United Kingdom. The dataset is published by Leicester City Council and represents a snapshot at the time of its last publication on 2026-06-17.
Leicester City Council publishes this annual dataset on staff and their time spent on trade union duties. The data covers the 2023/2024 period and is available in multiple formats including CSV, JSON, and Parquet. It was last updated on 2026-06-17.
Leicester City Council provides specific expenditure data for pertinent areas of IT services. The data covers spending from the 2016/17 financial year onward. It is available in multiple machine-readable formats including CSV, JSON, and Parquet.
Staff and their time spent on trade union duties, published by Leicester City Council. This data is published annually, with the latest update recorded in June 2026. The dataset likely contains records of union-related activities and time allocations for public sector employees.
Community Asset Transfers details land and assets transferred to community organizations. The dataset is published by Leicester City Council and was last updated on 2026-06-17 11:33:58.042561.
A dataset from www.datos.gov.co, last updated on 2026-05-18, containing records of user feedback for public services. It likely contains information on Petitions, Complaints, Claims, Suggestions, and Compliments (PQRSF) processed through the User Information and Attention System (SIAU). The data includes columns for AREA, PQRFS type, MOTIVO (reason), PROCEDENCIA (origin), NRO PQRFS (case number), and FECHA (date).
Fortnightly-updated categories of waste that licensed businesses in the Australian Capital Territory are permitted to handle. This dataset details the waste classifications set by the Waste Management and Resource Recovery (Waste Categories) Determination 2024. It is published by the ACT Government Open Data team and was last updated on May 12, 2026.
Surficial cover facies maps for three reefs within the Great Barrier Reef, Queensland. The dataset is published by the Australian Ocean Data Network on data.gov.au and was last updated on 2026-06-27. The specific data format and content require verification after download.
A field excursion guidebook for the Capricorn and Bunker Reefs in the southern Great Barrier Reef. The guide is published by the Australian Ocean Data Network and was last updated on 2026-06-27. The dataset is a legacy product with no abstract available, and its specific content requires verification after download.
MYD03 provides the precise geolocation data for every 1 km sample collected by the MODIS instrument aboard NASA's Aqua satellite daily. Each 5-minute swath file is approximately 30 MB, contributing to a daily volume of about 8 GB. This foundational dataset is produced by the MODIS Science Team and is a critical input for numerous higher-level land and ocean products.