Loading...
Loading...
Image classification, object detection, segmentation, face recognition, OCR, image generation, video understanding
15,883 datasets
Government Digital Service provides data on the performance of the IPO measured against its Ministerial Targets. The dataset originates from the eu_open_data platform. The specific temporal coverage, size, and granularity of the data are not detailed in the available metadata.
An organisational chart detailing the structure of the UK's Technology Strategy Board (TSB). The dataset originates from the Government Digital Service and is hosted on the EU Open Data portal. The specific number of rows, columns, and the last update date are currently unknown.
Organisation Charts showing team structures across the European Union's Investigation and Enforcement Division. The data is provided by the Government Digital Service and is updated on a regular basis. The specific number of charts, their temporal coverage, and detailed column information are not provided in the metadata.
An open-data benchmark for AI-driven screening of inorganic materials. The dataset pairs Raman and X-ray diffraction spectra with material properties. It was created to facilitate machine learning research in materials science.
Kyoei-Sakin-zawa Creek Section in Japan provides a 340-meter-thick Cretaceous stratigraphic record. The dataset contains rhenium-osmium abundance and isotope measurements from organic-bearing siltstone, sandstone, and tuff horizons associated with Oceanic Anoxic Event 1d (OAE1d). Sample collection was funded by an NSF-NERC award in August 2021, and the data is hosted by the British Geological Survey.
Stable oxygen isotope data (Ξ΄18O-PO4) measured on the HCl-extractable phosphorus fraction from a 1 cm-resolution sediment core. The British Geological Survey established the core chronology using 210Pb activity analysis and 8 age-depth tie points from a parallel core. This dataset provides a geochemical record from the Rutland Water Nature Reserve in the UK.
Supplementary tables from a 2026 study published in Proceedings of the Royal Society B by Rebecca Young. The tables likely contain data supporting an analysis of shared neural transcriptomic patterns underlying the repeated evolution of mutualistic cleaning behavior in Labridae wrasses. The dataset is published on figshare under a CC-BY-4.0 license.
Source data supports research on responsive interlayer spacing in staggered metal-organic framework nanosheet membranes. The dataset was authored by Xiaoyan Peng and last updated in April 2026. It is provided as a 4.8 MB Excel file under a CC-BY-4.0 license.
58.8 KB of Excel data supports research on Aggregation-Induced Emission Luminogens in ternary organic bulk-heterojunctions for solar cells. The dataset, authored by Xiangyu Li, was uploaded to figshare in April 2026. It contains supplementary information for a specific study on improving perovskite-organic tandem solar cell efficiency.
A dataset for training LaTeX OCR models to convert images of mathematical formulas into LaTeX source code. It was created by author harryrobert and last updated on 2026-04 03. The dataset is built with a 3-stage curriculum training pipeline and includes splits for different training stages.
An image dataset of butterfly species, likely for computer vision classification tasks. The dataset originates from Indonesia, as indicated by the title. The author, organization, and specific collection details are unknown.
A CNN public opinion poll conducted in April 2026. The dataset likely contains survey responses on topics including presidential approval, the economy, midterm elections, and Iran. It was published by CNN via the Roper Harvested Dataverse on May 21, 2026.
Australian government agency codes for managing Crown and Freehold land, compiled by Landgate. The data is part of the Landgate Tenure subscription service and Geospatial Tenure data. It was last updated on March 18, 2026.
NJU-LINK's OmniVideoBench is a large-scale benchmark dataset designed to evaluate multimodal large language models on joint audio and visual reasoning tasks. It addresses a gap in existing benchmarks that often focus on a single modality. The dataset was last updated on April 8, 2026.
The South-east Australian Marine Region, including Lord Howe Island, Tasmania, and the Great Australian Bight, was surveyed in two major campaigns in early 2000. The AUSTREA-1 and AUSTREA-2 surveys were commissioned by the National Oceans Office and Environment Australia to provide scientific information on the seabed. This data was intended to assist in implementing Australia's Ocean Policy and developing marine protected areas.
Survey data from the shallow coastal waters of the Cape York Peninsula, collected by the Australian Geological Survey Organisation in 1992 and 1993. The data covers a 1,000 km section of the inner shelf between Weipa and Cape Flattery, as part of the Cape York Land Use Strategy Project. It summarizes results from marine surveys assessing the natural resources of the region.
Inorganic element data from surface seabed sediments (0-2 cm) in the Timor Sea. The data was collected during the Petrel Sub-basin Marine Environmental Survey GA-0335 (SOL5463) in May 2012 by the RV Solander. This survey was a collaboration between the Australian Institute of Marine Science and Geoscience Australia under the National Low Emission Coal Initiative.
NASA research on the International Space Station (ISS) cultured 3D human neural organoids derived from induced pluripotent stem cells (iPSCs) for one month in low-Earth orbit. Parallel Earth-based controls were used to compare gene expression and histology, revealing accelerated cellular maturation in microgravity. This dataset contains results from dopaminergic organoids, part of a continuing study to understand neurological effects of space travel and treat neurodegenerative diseases.
A 87-class object detection dataset for Minecraft mobs. It contains 27,400 JPEG frames formatted for YOLO. The dataset was uploaded to Kaggle by an unknown author.
Gaojie Ye created a collection of prompt template examples for AI-generated image generation, shared on figshare in April 2026. The dataset is a 9.5 KB Excel file, indicating a small-scale reference set.