Loading...
Loading...
Image classification, object detection, segmentation, face recognition, OCR, image generation, video understanding
14,817 datasets
An organized inventory of information assets held by the University of Cartagena, regardless of physical or electronic format. The dataset is hosted on the Colombian open data portal and was last updated on May 18, 2026. It is designed to identify the information the institution possesses and where it can be consulted.
A synthetic collection of 10,000 skin lesion images generated by Safrina Kabir using a tone-conditioned Generative Adversarial Network (GAN). The images, at 128x128 pixel resolution, are equally divided between medium and dark skin tones and follow the class distribution of the HAM10000 dataset. This dataset, last updated in April 2026, is intended to support fairness-aware experimentation and data augmentation in dermatological machine learning.
A 1961 report details the geology and production of the Golden Peaks open-cut gold mine near Wau, Papua New Guinea. It describes 165,224 tons of ore processed yielding 44,497.98 ounces of gold and reserves of 245,000 tons at 0.17 ounces per ton. The document, provided by Geoscience Australia, analyzes the orebody composition and discusses hypotheses for its origin.
232.3 MB of satellite imagery samples for automated inland water surface detection in Bangladesh. Kona Moni created this dataset, which includes lakes, canals, ponds, and rivers, and was last updated on April 16, 2026. It provides annotations in COCO Segmentation Format and is pre-split into training, validation, and test subsets.
Line-level attestations of the LUNA logogram extracted from Linear B administrative tablets of the Mycenaean period. The dataset is structured for quantitative and network-based analysis of co-occurrence patterns and contextual relationships within the ancient script. Author Irene Orleansky published this resource via Harvard Dataverse in June 2026 for digital humanities and computational linguistics research.
597.2 MB of apple images, each file name encodes the year of capture and the number of days relative to the optimal harvest time. The dataset was created by Matouš Cejnek and last updated on 2026-05-04. Related source code is available on GitHub.
Yukon deposits provide data from a single borehole in the Conrad Carlin-type gold deposit. The dataset likely contains measurements of total organic carbon (TOC) averaging 1.31 wt. % with a maximum of 3.18 wt. %, and correlations with elements like Tl and As. This study was published by the Government of Yukon and last updated in April 2026.
Australia-based statutory infrastructure notice for a telecommunications network project in Brunswick, Victoria. The dataset includes a contract dated 30/3/2026 and an estimated completion date of 30/6/2026 for an FTTP network. It was published by the SIP Register for Net360 Pty Ltd on 2026-05-15.
A 2001 study presents evidence for a mid-Cretaceous source for southern Australian asphaltites, based on geochemical and carbon isotopic comparisons. The dataset likely contains analytical results comparing asphaltites with onshore source rock analogues from the Albian-Cenomanian Blue Whale Supersequence and the Albian Toolebuc Formation. It was prepared for the Eastern Australian Basins Symposium and is hosted by Geoscience Australia Data.
Polycyclic aromatic hydrocarbon (PAH) data from the McArthur River ore deposit in northern Australia. The dataset likely contains compound distributions and abundances used to infer temperature gradients and ore formation processes. It is provided by Geoscience Australia Data and was last updated in April 2026.
A dataset from figshare by Bingqing Ye, last updated April 2026, containing experimental results on sepsis-induced myocardial injury. The 1.3 MB dataset includes data from rat models and cardiac cell studies, published under a CC-BY-4.0 license. It likely contains measurements from survival analysis, cytokine levels, and apoptosis assays.
Gym locations and services across municipalities in the Risaralda department of Colombia. The dataset includes columns for organization name, municipality, zone, neighborhood, address, operating days, and services offered. It was published on www.datos.gov.co and last updated on 2026-05-18.
Organogram data for the UK's Defence Equipment and Support agency, showing all staff roles. Names and salaries are listed for Senior Civil Servants at the 2* grade and above. The data is published quarterly in validated CSV format under an Open Government Licence, with snapshots taken on 31st March, 30th June, 30th September, and 31st December each year.
100 fragmented LEGO mosaics for reconstructing objects from fragments to graphs. The dataset includes two variants: one with intact fragments and one with degraded fragments featuring edge erosion and missing pieces. It was created by author icimathieu and last updated on June 11, 2026.
3.8 MB of supplementary data from a study on GLYATL1 in luminal breast cancer. The dataset contains chromosomal coordinates and statistical metrics for differentially accessible chromatin regions identified via ATAC-sequencing in MCF7 parental and long-term estrogen deprived (LTED) cells. It was authored by Janina Müller and last updated on April 30, 2026.
Supplementary Table 4 from a 2026 study by Janina Müller, published on figshare. The dataset contains DoRothEA scores and p-values predicting transcription factor activities from RNA-sequencing data. It compares MCF7 parental cells, long-term estrogen deprived (LTED) cells, and GLYATL1 knockout clones.
A 19.0 MB dataset from a manuscript on mitigating anaesthetic emissions. It contains raw data and exported spreadsheets for PXRD, FTIR, TGA, N2 adsorption isotherms, simulated gas adsorption isotherms, and breakthrough analysis. The dataset was authored by Ashleigh Chester and last updated on 2026-04-28.
An audio question-answering dataset for clinical and healthcare contexts created by CentificAIResearch. It contains audio recordings paired with clinical questions and answers across multiple QA types, designed to benchmark audio-language models on medical reasoning tasks. The dataset is organized into 7 subfolders.
Over 170 sea surface temperature and chlorophyll-a images from July 2002 to December 2016, with a spatial resolution of approximately 1 kilometer, were used to map the Bonney Coast upwelling. The data was processed by Geoscience Australia using Topographic Position Index and image segmentation techniques to extract upwelling signatures. The results confirm the upwelling is a seasonal system occurring between November and April, with strong inter-annual variation.
4.3 MB of original data from the paper "Reconstitution of mouse spermatogenesis and continuous generation of functional haploid germ cells in testicular organoids". The dataset includes the underlying data for all scatter plots, line graphs, and tables, as well as DEG datasets used for GO term enrichment analysis. It was authored by Fang Luo and last updated on 2026-05-25.