Loading...
Loading...
Image classification, object detection, segmentation, face recognition, OCR, image generation, video understanding
15,476 datasets
A unified dataset for weed detection and segmentation in crops, optimized for YOLO models. It combines an original dataset of 9 common weed species from the León region with a portion of the Veridis dataset. The dataset was created by unileon-robotics and was last updated on Hugging Face in May 2026.
Research data on thermogenic molecular hydrogen generation from organic matter in geological systems. The dataset likely contains results from open-system pyrolysis experiments on source rocks from the Songliao, Cooper, Georgina, and other basins, with inferred global in-place accessible H2 estimates of approximately 3.5E+10 tonnes. The data is associated with a 2022 journal article and was published by Geoscience Australia Data.
Replication Data for: Propaganda of Democratic Flaws and Its Impact on Regime Evaluations in Autocracies includes text data from Chinese state-affiliated media Weibo posts from 2013 to 2024 and results from two original survey experiments conducted in China. The data was authored by Rex Weiye Deng and is hosted by The Journal of Politics Dataverse. It was last updated on May 14, 2026.
2015-2030 population estimates for Uganda, providing the total number of people per 3 arc-second grid cell (approximately 100m at the equator). The data is available in GeoTIFF format, with units representing the number of people per pixel. This 2025 Alpha release version was constructed in September 2025 using a Random Forest-based dasymetric redistribution mapping approach.
WorldPop produced a 2025 Alpha release of gridded population estimates for the Democratic People's Republic of Korea. The data provides the total number of people per grid cell for the period 2015-2030, using a Random Forest-based dasymetric redistribution method at a 3 arc-second (approximately 100m) resolution in WGS84 projection. The dataset was constructed in September 2025 and last updated on the platform in March 2026.
2015-2030 population estimates for Laos, with total people per grid cell. The dataset is a 2025 Alpha release from WorldPop, constructed in September 2025, using a Random Forest-based dasymetric redistribution method. Data is provided as GeoTIFF files at a 3 arc-second resolution (approximately 100m at the equator) in the WGS84 coordinate system.
A political science dataset replicating analysis on the Gallagher index, a popular measure of electoral disproportionality. The author, Jack Bailey, demonstrates the index's maximum value is constrained by the number of effective parties and proposes a new normalized measure. The dataset was last updated on 2026-05-19 via Harvard Dataverse.
Democratic Republic of the Congo population estimates per 100m grid cell, produced by WorldPop. The dataset provides total number of people per pixel for the years 2015 to 2030, generated using a Random Forest-based dasymetric redistribution method. It is available in GeoTIFF format with a WGS84 projection.
Registry data covers Canadian charities registered under the Income Tax Act and eligible to issue official donation receipts. Information includes financial details, activities, and directors or similar officials, compiled from annual T3010 returns submitted to the Canada Revenue Agency. The dataset was last refreshed on March 6, 2026.
A 20% random subset of the Food101 dataset, filtered to contain only three food classes: Pizza, Steak, and Sushi. The dataset was created by Shad0wKillar and was last updated on 2026-04-30. It is pre-split into train and test directories, mirroring the original Food101 dataset's 75/25 split ratio.
Open Government Portal Department List contains the registry of departments and organizations registered on Canada's open data portal. The dataset includes bilingual title fields in English and French, along with historical and unified department identifiers. It is maintained by the Treasury Board of Canada Secretariat and was last updated in March 2026.
Global ocean basins contain roughly 1500 historical measurements of radiocarbon (Δ14C) in dissolved inorganic carbon, compiled from published articles. The National Oceanic and Atmospheric Administration digitized samples collected at the surface and various depths between 1955 and 1974. Data includes location, depth, temperature, salinity, Δ14C, δ13C, and dissolved inorganic carbon concentration.
New York City agencies, offices, and organizations with governance functions, including advisory, regulatory, public benefit, and elected bodies. The dataset includes details such as operational status, organization type, and leadership. It is maintained by the NYC Office of Data Analytics and was last updated in April 2026.
Kalynivka village council's organizational structure is documented in this dataset, sourced from the official "Structure and staffing" administrative document. The data originates from the States site of Ukraine and was last updated on 2026-05-06. Its specific focus on a single local government body provides a detailed snapshot of administrative organization.
Offering aggregated statistics on natural hazard events in the Democratic People's Republic of Korea, maintained by the Centre for Research on the Epidemiology of Disasters (CRED). It tracks disaster frequency, human impact, and economic damage categorized by year and specific disaster subtypes. The records are updated through March 2026 to reflect historical and recent hazard trends.
12,000+ individuals are represented in this dataset of over 150,000 images. It contains selfies paired with two official ID photos, such as passports, ID cards, driver's licenses, and residence permits. The dataset was created by AxonData and features balanced demographics across ethnicity, gender, and age.
S-111 is an international hydrographic data standard for encoding surface water current forecasts. This collection contains forecast guidance from NOAA's Operational Forecast Systems and the Global Real-Time Ocean Forecast System (GRTOFS) for U.S. coastal waters and the Great Lakes, encoded as HDF-5 files. Forecasts are generated four times daily for most models and represent currents at a depth of 4.5 meters or half the water column depth.
Seasonal climatologies for the Canadian Pacific Exclusive Economic Zone are derived from a 3-km resolution ocean model simulation spanning 1993 to 2020. The data contain raster layers for 11 variables including temperature, salinity, nitrate, and phytoplankton across 47 vertical levels. This model output, produced by Fisheries and Oceans Canada, represents the climatological state of the region.
1981-2010 seasonal climatologies for the Canadian Pacific Exclusive Economic Zone, derived from a 3-km resolution ocean model simulation. The dataset contains raster layers for 11 variables including temperature, salinity, nitrate, and primary production across 47 vertical levels. Model results have been validated against tide gauge, CTD, and altimetry observations.
5.5 KB of experimental data on the effects of hypobaric hypoxia on rats, authored by Huan Wang and last updated in April 2026. The dataset is available in XLS format under a CC-BY-4.0 license on figshare. It likely contains measurements of body weight, food intake, and hematocrit levels.