Loading...
Loading...
Image classification, object detection, segmentation, face recognition, OCR, image generation, video understanding
15,423 datasets
Sparkle is a large-scale dataset containing approximately 140,000 high-quality source-edited video pairs for video background replacement. It was created by Ziyun Zeng, Yiqi Lin, Guoqiang Liang, and Mike Zheng Shou and is hosted on Hugging Face. The dataset is organized into five distinct themes, as detailed in the associated research paper.
Canada Energy Regulator survey data measures participant perspectives on the transparency of adjudication processes following public hearings. The dataset tracks the percentage of surveyed participants who rated transparency as a 4 or 5 on a 5-point scale. Data collection concluded after the 2025-2026 fiscal year.
Nine samples of Cretaceous Toolebuc Formation carbonaceous shales and two coquinite samples were analyzed using high-energy resolution fluorescence detection x-ray absorption spectroscopy. The dataset characterizes uranium oxidation state and distribution, with total organic carbon content ranging from 0.3 to 13.4 wt%. The research was presented at the 2023 Goldschmidt Conference and published by Geoscience Australia.
Three growing seasons of cotton fields in Mississippi from 2021 to 2023 are documented in this dataset. It contains 720 x 720 pixel aerial image tiles with XML annotations for four contamination classes: bags, bottles, cans, and trash. Sean Donohoe created this public domain dataset to support vision systems for detecting field contamination before harvest.
Between February and March 2021, UNHCR and WFP jointly assessed 8,630 refugee households from South Sudan in the Biringi, Bele, and Meri sites of the Democratic Republic of the Congo. The assessment aimed to update knowledge on humanitarian needs to inform program decisions and targeting strategies. This dataset is an anonymized 20% stratified random sample of the original survey data.
Proxima's War.gov UAP Corpus is a provenance-first mirror and source index for files listed in the U.S. Department of War's UAP/UFO release portal. It aims to preserve the official release listing, public source URLs, verified file hashes, and byte sizes before any processing. The dataset was last updated on 2026-05-09.
A genomic dataset containing the assembled and annotated organelle genome for an organism referred to as 'Cu'. The dataset was authored by 振宇 Lyu and was last updated on April 30, 2026. It is available under a CC-BY-4.0 license and consists of PDF and TXT files totaling 5.7 MB.
Southern Australian margin asphaltites are compared geochemically and isotopically to onshore source rock analogues from the Albian-Cenomanian Blue Whale Supersequence and the Albian Toolebuc Formation. The dataset includes evidence from carbon isotopes, C30 methylsteranes, C26 norsteranes, and metalloporphyrins, supporting a mid-Cretaceous source hypothesis. The Australian Ocean Data Network published this analysis for the Eastern Australian Basins Symposium in 2001.
Geochemical and carbon isotopic evidence compares asphaltites from the southern Australian margin with Albian-Cenomanian source rock analogues. The analysis includes biomarkers like C30 methylsteranes, C26 norsteranes, and metalloporphyrins, supporting a genetic link to the mid-Cretaceous Toolebuc Formation. This research was presented at the Eastern Australian Basins Symposium in 2001.
Offering standardized humanitarian organization data sourced from the HDX Humanitarian API (HAPI) as of March 2026. It contains operational partner information formatted with Humanitarian Exchange Language (HXL) tags to enable interoperability across humanitarian response systems.
Experimental data on phosphorus fractions in maize rhizosphere soil from a study on the effects of exogenous PHY and PSB additions with or without organic phosphorus application in low-phosphorus red soil. The dataset is a 9.5 KB XLS file uploaded by Long Zhou to figshare and last updated on May 4, 2026. It is licensed under CC-BY-4.0.
A 5.5 KB XLS file published by Long Zhou on figshare in May 2026. The dataset likely contains measurements of maize root morphological traits under different phosphorus treatments, including the exogenous addition of PHY and PSB with or without organic phosphorus application.
Sema İncekara authored this dataset on measures to increase organizational commitment. It is a 5.5 KB Excel file, last updated on May 4, 2026, and is licensed under CC-BY-4.0.
A 5.5 KB Excel file uploaded to figshare by Sema İncekara on May 4, 2026. The dataset examines factors that negatively impact teachers' commitment to their organizations. Its specific contents and scale are not detailed in the available metadata.
A 5.5 KB Excel file uploaded to figshare by Sema İncekara, last updated on May 4, 2026. The dataset likely contains tabular data analyzing factors that increase organizational commitment among teachers. The specific variables, sample size, and data collection method are not detailed in the provided metadata.
Sema İncekara published a 5.5 KB Excel file containing a correlation matrix between organizational happiness and commitment on figshare. The dataset is licensed under CC-BY-4.0 and was last updated on May 4, 2026. Column names and row counts are not specified in the available metadata.
291 historical polities form a panel dataset used to analyze the relationship between meritocracy and autocracy. The data supports a paper by Clair Yang proposing meritocracy as a power-sharing institution in authoritarian regimes. It was last updated on May 14, 2026, in The Journal of Politics Dataverse.
A geological dataset from the Australian Ocean Data Network describes magnetite concentrations in beach sands on Bougainville Island. The magnetite, combined with titanium dioxide, is derived from recent andesitic volcanoes including Mt. Balbi, Mt. Bagana, and Mt. Taroka. The description suggests magnetic survey methods may help delineate zones of magnetite concentration in coastal plains.
Survey data from 570 Chinese university students explores socio-ecological correlates of exercise procrastination and exercise addiction. It includes assessments using the Procrastination in Exercise Scale (PES) and Revised Exercise Addiction Inventory (EAI-R), with candidate predictors from individual characteristics, health behaviors, and interpersonal networks. The analysis applied LASSO regression with cross-validation and sensitivity checks.
Australian Ocean Data Network provides a study on organic geochemistry in sedimentary rocks. The research uses biological marker distributions to infer the source, depositional environment, biodegradation, and maturity of organic matter. The dataset was last updated on 2026-04-10.