Loading...
Loading...
Image classification, object detection, segmentation, face recognition, OCR, image generation, video understanding
15,544 datasets
FL510, a transgressive salinity-tolerant rice genotype, has its constitutive metabolome and lipidome compared to its parents IR29 and Pokkali. The dataset includes identified analytes from principal component and partial least squares discriminant analyses, supported by transcriptome data on pathway-related genes. Author Isaiah Catalino Pabuayon published the data on figshare under a CC0 license, with a last update in March 2026.
Natural hazard statistics for the Lao People's Democratic Republic track disaster frequency, human impact, and economic damage in XLSX format. Produced by the Centre for Research on the Epidemiology of Disasters (CRED) and updated through 2026, the data is aggregated by year and disaster subtype.
44,663 traditional Chinese pharmaceutical label documents from Taiwan's Food and Drug Administration (TFDA). The dataset was created by twinkle-ai and last updated on 2026-05-03. Each record contains rendered WebP images of all PDF pages and structured data extracted into a 17-field JSON schema.
Aggregated historical data on natural hazard events in Uganda, compiled by the Centre for Research on the Epidemiology of Disasters. The records quantify disaster frequency, human fatalities, and economic damages categorized by year and specific disaster subtypes.
Aggregated annual statistics on natural hazard events in the Democratic Republic of the Congo, categorized by disaster subtype. Produced by the Centre for Research on the Epidemiology of Disasters (CRED), the data tracks human impact and economic costs through early 2026. Each record summarizes disaster frequency, fatalities, and financial damage for a specific hazard type within a given year.
5,666 scene photographs annotated for mirror surfaces, introduced at CVPR 2020. The dataset includes 5,095 training images with masks and edges, and 571 test images with masks only. It was created by author 'garrying' and last updated on the Hugging Face platform in April 2026.
Liang Liu provides full electrophoresis and other images supporting research on how rRNA intermediates coordinate nucleolar architecture in the model organism C. elegans. The dataset is a 56.2 MB ZIP file published under a CC-BY-4.0 license on figshare. It was last updated on 2026-04-27.
Records detail kilograms of cocaine paste and base seized by Colombian public forces during operations. Data is reported by municipality and department, with entries including the seizure date. The dataset is published by Colombia's national open data portal, www.datos.gov.co, with a last recorded update in March 2026.
A dataset containing 100 episodes of robot action data, totaling 103,706 frames and 300 videos, created using LeRobot. It was uploaded by YOLO2431 to Hugging Face on May 7, 2026. The dataset is structured for a single task involving a Yam bimanual robot.
An automatically annotated dataset for the Corpus Clarification task, introduced in a 2026 paper by Lequeu et al. The dataset transforms noisy, multi-topic citizen contributions from the Grand Dรฉbat National into structured data. It was authored by LequeuISIR and last updated on Hugging Face in April 2026.
Information on the organizational structure of the financial department of the Lytyn Village Council by years. The dataset is provided by the States site of Ukraine and was last updated on 2026-05-06. It is available in common tabular formats like Excel and CSV.
An English translation of a Chinese corpus for training Socratic teaching models. The dataset was created by ulises-c and last updated on May 4, 2026. It enables English-language research and fine-tuning without requiring access to the original Chinese data.
Preclinical investigations and pilot clinical imaging studies for a series of peptide-derived, c-Met-targeted PET probes labeled with Gallium-68. It reports synthesis details, in vitro/in vivo stability (>90%), and clinical results including tumor-to-lung ratios and correlation with c-Met expression (R=0.71). The dataset is 265.7 KB in size.
GlaciStore is a pre-proposal cover sheet submitted to the Integrated Ocean Discovery Programme (IODP) on 31 March 2014. The document, led by Heather Stewart of the British Geological Survey on behalf of a 25-member consortium, outlines scientific objectives and details 12 proposed drill sites for investigating glacial history and basin processes relevant to offshore CO2 storage in the North Sea. The publicly available cover sheet includes an abstract, research objectives, and a table of site coordinates, water depths, and drilling targets.
Replication Data for 'How Filibuster Rhetoric Informs Perceptions of Politicians' by Kevin Banda of Legislative Studies Quarterly. The dataset contains results from a preregistered survey experiment and secondary cross-sectional survey analysis, last updated on May 12, 2026. It examines how elite messaging about the filibuster shapes citizens' ideological and affective evaluations of political figures.
jcabshear created a dataset of manually captioned fantasy character images for training generative AI models. The newest version includes 30 images each for races and classes such as aarakocra, dragonborn, elf, and tiefling. This collection was last updated on 2026-04-24 and is hosted on Hugging Face.
Southern Ocean and Indian Ocean data from the 2012 CLIVAR I09S cruise aboard the Aurora Australis includes discrete and profile measurements of dissolved oxygen, nutrients, salinity, and temperature. The dataset supports the International CLIVAR program's goal of quantifying changes in ocean heat, freshwater, and carbon storage. Measurements were collected using CTD, bottle, and PAR sensor instruments between January 5 and February 12, 2012.
Procrustes distances between pairs of Astroblepus species, with p-values indicating statistically significant differences. The dataset is a small 11.1 KB Excel file authored by Kevin P. Chugรก-Puetate and last updated in April 2026. It is openly licensed under CC-BY-4.0.
Aggregate data for building permits issued by the City of Winnipeg since 2010 includes permit activity by building type and community. It tracks residential dwelling units created and lost, along with declared construction values. The dataset is provided by data.winnipeg.ca and was last updated in April 2026.
OCR-recognized text from the street writings of Tsang Tsou Choi (King of Kowloon) in Hong Kong during the 1990s, prior to the 1997 handover. The dataset is a 5.2 KB TXT file, created by QI ZHANG as a course project for the Master of Teaching (LTCL) program at The Education University of Hong Kong. It was last updated on April 17, 2026.