Loading...
Loading...
Image classification, object detection, segmentation, face recognition, OCR, image generation, video understanding
15,402 datasets
Four ATom (Atmospheric Tomography) campaigns collected high-frequency measurements of volatile organic compounds (VOCs) using the Trace Organic Gas Analyzer (TOGA). The dataset includes radical precursors, tracers of anthropogenic and biogenic activities, and compounds relevant to aerosol formation and atmospheric processing. Data were produced by ORNL_CLOUD, with the most recent metadata update noted as March 2026.
5.5 KB of tabular results from an ablation study evaluating the MS-YOLOv11 object detection model on a rooftop photovoltaic dataset. The data is stored in an XLS file and was authored by Jiajun Zhu, last updated on May 6, 2026. The dataset is licensed under CC-BY-4.0.
5.5 KB of tabular data contains oral bioavailability predictions for selected compounds based on Veber and Egan rules, as calculated by the SwissADME server. The dataset was authored by Courage Siame and uploaded to figshare, with a last update timestamp of 2026-05-06. It is available under a CC-BY-4.0 license.
Binding energy (ฮG, kcal/mol) and inhibition constant (Ki, ฮผM) values for organosulfur compounds derived from Allium sativum L. (garlic). Courage Siame uploaded this dataset to figshare in May 2026. The data is stored in a 9.5 KB XLS file.
106,977 grayscale image crops packaged for training classical face detectors like Viola-Jones and Haar cascades. The dataset includes a large pool of natural-image negatives from Caltech-256 and a specific CBCL benchmark split for testing. It was uploaded by user salvacarrion to Hugging Face and last updated on 2026-05-14.
A 1976 research cruise by HMAS Diamantina collected about 2000 manganese nodules from the Indian Ocean floor southwest of Cape Leeuwin, Western Australia. The survey targeted a deposit initially discovered in 1970 and estimated to cover approximately 900,000 square kilometers. The work was conducted by researchers associated with the Australian Ocean Data Network.
Evidence of aeolian deposition was identified in the Jurassic Jurgurra Sandstone during a 1976 geological mapping survey in the Canning Basin. The dataset consists of a published note presenting an environmental analysis of a specific 5-meter-thick creek exposure. It was published by the Australian Ocean Data Network and last updated in April 2026.
A 2026 stratigraphic interpretation correlates Cretaceous sedimentary rocks from Valanginian to Campanian age across onshore and offshore wells in Western and South Australia. The Australian Ocean Data Network compiled biostratigraphic and seismic data, identifying supersequences and organic-rich facies. The succession reaches over 357 meters thick in the Madura 1 well.
Government of Ontario data tracks youth who re-offend within two years of completing a community disposition or custody sentence. The dataset is organized by fiscal year of completion, region, sentence type, and re-offend status, providing total counts for tracked individuals. It is not representative of all youth served by Youth Justice Services.
Processed keypoints from the ASL Citizen dataset, intended for isolated sign recognition and encoder pretraining. The dataset was created by SharoonArshad and last updated on May 17, 2026. It is designed as the first stage of a larger pipeline for translating continuous American Sign Language to English.
A geological study uses eustatic sea-level changes to correlate Late Permian coal-bearing sequences across the Sydney, Gunnedah, and Bowen Basins in eastern Australia. The analysis identifies specific marine formations deposited during the same highstand, such as the Dempsey Formation and Black Alley Shale. This work was published by the Australian Ocean Data Network and last updated in April 2026.
The R.V. Valdivia survey in 1977 collected 1700 km of 24-channel seismic data and 2550 km of bathymetric, gravity, and magnetic data, along with 31 bottom samples. The Australian Ocean Data Network hosts this dataset, which was last updated in April 2026. It provides information on the geological structure and sediment history of the region between northwest Australia and the Java Trench.
Data collection for NASA PACE validation efforts, targeting phytoplankton carbon biomass, particulate organic carbon concentration, net primary production, chlorophyll fluorescence quantum yields, and inherent optical properties. The dataset is maintained by the OB_DAAC organization and was last updated in March 2026. It is available on multiple government platforms, indicating its importance for satellite calibration.
Fossil specimens from the Kimmeridgian, Tithonian, Neocomian, and Aptian stages are documented in this collection. The data was compiled by researchers citing publications from the 1940s to 1958, primarily from the Bureau of Mineral Resources and West Australian Petroleum Pty Ltd. Work for this paper was largely completed before mid-1954, with the main record published in a bulletin substantiating earlier fossil findings.
From August 2023 through December 2025, a dataset of 329 employees was collected for a study on workplace democracy in relation to turnover intention. The study was authored by Steven Mellor and Ross Elliott, with planned journal submission in June 2026. The data is hosted on the Harvard Dataverse platform.
Geoscience Australia houses one of the world's largest collections of petroleum data. The collection includes digital well completion reports, well logs, core photography, and hard-copy data from the pre-digital era, submitted by industry under legislative requirements and gathered by government research projects.
Zewdu Tessema created a dataset of socio-demographic characteristics for key informants in selected organizations and public health facilities. The data covers the South Gondar Zone in the Amhara region of North central Ethiopia for the year 2022. The dataset contains 21 records and is available in XLS format under a CC-BY-4.0 license.
TiniX Vietnam OCR Annual Financial Statements is a text dataset containing OCR-extracted content from annual financial reports of companies listed in Vietnam. The collection includes 18,231 reports in Vietnamese corresponding to 1,491 different stock codes. The data was collected and processed by TiniX AI and was last updated on the platform in May 2026.
The 2019 B-Series De-trended Global Isostatic Residual Gravity Map of Australia is a Hue-Saturation-Intensity image derived from the 2019 Australian National Gravity Grids. It combines 1.4 million ground gravity observations, 345,000 line km of airborne gravity data, and 106,000 line km of gravity gradiometry data, sourced from government, industry, and research organizations from the 1940s onward. The map uses northwest shading and a linear color scale from -500 ยตm.s-2 to +500 ยตm.s-2.
114,971 files contain occurrence records, R scripts, and results for Technomyrmex albipes and T. brunneus. The dataset supports analyses of climatic niche divergence and future contact zones under CMIP6 climate scenarios for 2050 and 2090. It includes species distribution models built with Maxent using specific WorldClim bioclimatic variables for each species.