Loading...
Loading...
Image classification, object detection, segmentation, face recognition, OCR, image generation, video understanding
14,366 datasets
A dataset from figshare by Špela Janež, last updated May 2026. It contains results from a phenotypic screen of chimeric pattern recognition receptor (PRR) ligands in human peripheral blood mononuclear cells. The data likely includes measurements of cytokine signatures and cytotoxic immune activity for different conjugate combinations.
Organogram data from all UK central government departments and agencies has been published twice yearly since 2010. The data includes staff roles and lists names and salaries for Senior Civil Servants. Snapshots for March 31 and September 30 are validated and released in CSV format under the OGL license.
An inventory of IT hardware and software assets for Colombian government entities, bodies, and organizations. The dataset includes columns for asset confidentiality, value, custodian, criticality, availability, and technical specifications. It was last updated on 2026-05-18 and is hosted by datos.gov.co.
Australian Ocean Data Network provides data on benthic nutrient and gas fluxes, water column, and sediment properties from St. Georges Basin, a coastal lagoon in southeastern Australia. The study investigates how diatoms control nutrient and carbon cycles by coupling benthic and pelagic processes, particularly focusing on nutrient fractionation. Data was last updated on 2026-05-04.
Certified organic farms and businesses physically located in Iowa that have passed inspection from a USDA National Organic Program Accredited Inspection Agency. Columns suggest detailed records for each operation, including Operation ID, name, certifier, certification status, and effective dates for crops, livestock, wild crops, and handling activities. The dataset also includes physical address fields and geographic references like Iowa ZIP Code Tabulation Areas and watersheds.
The BOREAS TGB-03 team collected Dissolved Organic Carbon data during the summer of 1994 in the Northern Study Area. This NASA-led project aimed to establish major sources, sinks, and fluxes of DOC in the boreal ecosystem. Data from environmental samples were intended for combination with hydrologic measurements to calculate carbon budgets.
Peroxide and ozone concentration data were collected to study biogenic hydrocarbon emissions in boreal forest carbon cycles. The dataset includes measurements for hydrogen peroxide, total organic peroxides, ozone, and their deposition velocities. NASA's BOREAS TGB-10 team gathered this data at the SSA Old Jack Pine site during the 1994 growing season.
Simulation results capture six distinct saturation-controlled displacement processes, including primary drainage and subsequent imbibition cycles, for an oil/water system. The 216.0 KB CSV file contains data calculated using a Level Set and Lattice Boltzmann workflow on a 200^3 voxel geometry of Castlegate sandstone. Johan Olav Helland published this dataset on figshare in April 2026.
A 2020 map from the Dutch Ministry of the Interior and Kingdom Relations shows the percentage of people over 65 living in the most paved neighborhoods of Zuid-Holland province. It is intended to help organizations prioritize urban greening efforts to mitigate heat stress among vulnerable elderly residents. The map filters out districts that are less than 60% paved or where less than 10% of residents are over 65.
Bun Chan used high-level quantum-chemistry methods to compute heats of formation for 539 organophosphorus species missing from the NIST Chemistry WebBook. Machine learning was applied to decompose these values into atomic contributions, revealing thermochemical effects of substituents like CF3 groups. The dataset includes revised atomic parameters for economical HOF calculation protocols.
A set of 539 organophosphorus species from the NIST Chemistry WebBook have their heats of formation calculated using high-level quantum-chemistry methods. The dataset, created by Bun Chan and last updated in May 2026, uses machine learning to decompose the heats of formation into atomic contributions. The work also revises optimal atomic values for atomization methods and identifies cost-effective computational protocols.
Performance metrics for a Convolutional Neural Network (CNN) deep learning model. The dataset is a 9.5 KB Excel file published by Jung-Bin Park on figshare. It was last updated on June 4, 2026.
A comparison dataset for computer vision models, specifically ResNet-50 and DRP, analyzing performance across varying numbers of pulses and images. The dataset was authored by Jung-Bin Park and last updated on June 4, 2026. It is a small dataset of 5.5 KB, available in XLS format under a CC-BY-4.0 license.
Major Safety Events is a monthly time series of major safety and security incidents reported by U.S. transit agencies to the Federal Transit Administration's National Transit Database. The dataset covers events from January 2014 onward, published with a typical 90-day lag. The Department of Transportation maintains this record, which includes a transit worker assault flag implemented after a 2023 reporting change.
The dataset supports the preprint "Synaptic and neural pathway redundancy enables the robustness of a sensory-motor reflex and promotes predation escape in C. elegans" by Haoming He et al. It is a 31.3 MB ZIP file containing behavioral data, last updated on 2026-05-04. The associated analysis code is available on GitHub.
100,000 hours of video data collected from a first-person perspective, designed to support advanced computer vision and multimodal AI systems. The dataset was created by InfoBayAI and was last updated on June 3, 2026. It is intended for training models in video understanding and temporal reasoning.
Anonymized records of athletes and para-athletes supported by Indeportes Antioquia across multiple years. The data includes age ranges, gender, sport, and the sporting organization for each individual. The dataset is hosted on the Colombian open data portal www.datos.gov.co and was last updated on 2026-05-18.
502.2 KB of experimental data from figshare, authored by Norhafizah Binti Sidek and last updated on 2026-04-15. The dataset examines the role of salicylic acid signaling in the phosphate-dependent growth promotion of Arabidopsis thaliana by the root endophyte Colletotrichum tofieldiae. It includes measurements of plant growth, biomass, and elemental nutrient content under low, moderate, and high phosphate conditions.
The Monitoring Matrix for Training Activities from the Integral Monitoring Advisory Office is a tool for planning, executing, and evaluating training programs. It systematically organizes and visualizes all training activities carried out by the office to ensure efficient tracking of their development and results. The dataset is hosted on the Colombian open data portal www.datos.gov.co and was last updated on 2026-05-18.
David Stadelmann's dataset supports the paper "The beauty premium in politics? Perceptions and political behavior". It includes data and code linking independent beauty ratings of Swiss politicians to their interest group affiliations and alignment with voter preferences. The dataset was last updated on 2026-04-28 and is licensed under CC-BY-4.0.