Loading...
Loading...
Image classification, object detection, segmentation, face recognition, OCR, image generation, video understanding
14,835 datasets
A large-scale synthetic dataset of physically-simulated multi-object interaction scenes generated using NVIDIA Isaac Sim and the PhysX physics engine. It is designed to train and evaluate AI models on physical reasoning, rigid body dynamics, optical flow, depth estimation, and scene understanding.
Geoscience Australia's compilation of ship-track gravity, magnetic, and bathymetry data, levelled to reduce errors between surveys from the 1970s onward. The data is organized by survey in ASCII files with records spaced approximately every 150 meters, containing both original and levelled measurements. This levelling work was performed by Desmond FitzGerald & Associates and Ron Hackney between 1998 and 2011, with final review by Yvette Poudjom Djomani.
HeiGIT produced this dataset mapping approximately 8,000 km of arterial roads in Laos using 2020 and 2024 PlanetScope satellite imagery. It provides AI-derived classifications for road surface, width, and humanitarian passability for motorway, trunk, primary, and secondary road segments.
The SNOWS-3 survey is a deep-seismic acquisition program conducted by the Australian Geological Survey Organisation (AGSO) in the second half of 1993. It aimed to acquire at least 3000 km of high-quality seismic data in the offshore Canning Basin to determine the regional structural framework and deep crustal structure of Australia's southern North West Shelf. The survey was designed to tie in with previous surveys and link principal exploration wells in the region.
A dataset from the Government Digital Service tracking the percentage of Subject Access Requests (SARs) processed on time by UK public authorities year-to-date. The data relates to compliance with the Data Protection Act and the Freedom of Information Act 2000, which grants individuals the right to access personal data and public information. The dataset is provided in CSV format under the OGL-UK-3.0 license.
Meta-analysis data aggregates results from three randomized controlled trials (CLASSIC, CLEAR, ADVOCATE) involving 440 patients with ANCA-associated vasculitis. The dataset includes primary endpoints for clinical response, remission rates, renal function changes, and safety outcomes like adverse events. It was authored by figshare admin karger and last updated in April 2026.
101 indoor scenes contain 5,000 high-resolution stereo image pairs labeled with millimeter-level ground truth depth and disparity. The dataset, created by IlyaInd, includes paired pinhole and fisheye samples across varying fields of view. It was last updated on the Hugging Face platform in May 2026.
Two tables provide metabolomics data for 12 rat brain samples across hippocampus and cortex regions. The data, covering amino acids, organic acids, sugars, lipids, and phosphorylated compounds, reveals significant concentration differences between brain regions and experimental groups. Linxia Li published this 14.8 KB dataset on figshare under a CC-BY-4.0 license, last updated in April 2026.
A global dataset integrates meta-analysis and 13C-tracer experiments to quantify organic acid-induced abiotic destabilization of soil carbon. It contains model inputs and results showing this pathway contributes 9% ± 1% of global soil organic carbon mineralization. The dataset was created by Zhenhui Jiang and published on figshare in April 2026.
Our World in Data provides 8,453 observations on the Age of Full Democracy across 49 Asian countries. The dataset spans the years 1789 to 2025 and was repackaged by Electric Sheep Asia. It is licensed under CC BY 4.0.
A study investigated background levels and uptake rates of organophilic metals, particularly selenium, in ten streams draining portions of the Yukon-Tanana Terrane and Cassiar Platform between Ross River and Watson Lake, Yukon. Water, sediments, benthic invertebrates, and fish (slimy sculpin) were sampled and analyzed for metals concentrations, and benthic invertebrate community composition was documented. The findings provide baseline information on natural metals concentrations in an aquatic system of interest for mineral exploration.
A study from 2008 and 2007 designed to fill an information gap on metal speciation in northern streams. It was conducted by the Government of Yukon at six discrete watercourse sites over two seasons to understand metal mobility and exposure in aquatic environments.
OmniRooms is a large-scale synthetic indoor dataset for panoramic 3D perception, depth estimation, and monocular view synthesis. It contains 16 large indoor scenes with multiple rooms, totaling 271,000 equirectangular RGB panoramas. The dataset was created by Insta360-Research and was last updated in June 2026.
A dataset of monocratic decisions from Brazil's Superior Court of Justice (STJ) concerning the civil liability of shopping centers for violent crimes committed by third parties on their premises. It contains legal texts in PDF and DOC formats totaling 75.6 KB, published under a CC-BY-4.0 license by Renata Dantas de Oliveira Mercadante. The dataset was last updated on May 11, 2026.
Personnel of the BOREAS Information System compiled geographic coordinate and site characteristic information from several sources throughout the experiment period. The final set is organized into two data sets providing coordinates for single sites and corner coordinates for standard geographic areas. Data are stored in two ASCII text files.
A map showing the toxic pressure from mixtures of other organic substances on Dutch surface water, based on samples taken between 2013 and 2018. The data, provided by the Dutch Ministry of the Interior and Kingdom Relations, classifies toxic pressure into five increasing levels to indicate the potential impact on aquatic life communities. This information serves as a signal for measures needed to make water suitable for social purposes like drinking and irrigation.
Global Navigation Satellite System (GNSS) data provides autonomous geo-spatial positioning with worldwide coverage from a network of permanent ground-based receivers. The dataset, managed by NASA's Crustal Dynamics Data Information System (CDDIS), includes hourly files of 30-second sampled observations, broadcast ephemeris, and meteorological messages in RINEX format. It archives data primarily from GPS and GLONASS, with additional systems like Galileo and Beidou added since 2011.
China's soil organic carbon sequestration data, likely derived from a study on bare land restoration. The dataset is 7.4 MB in size and was published by Yinchuan Xiang on figshare under a CC-BY-4.0 license. It was last updated on May 9, 2026.
Australia's gravity anomalies are visualized in a Hue-Saturation-Intensity image derived from the 2019 B-Series National Gravity Grids. The map combines over 1.4 million ground observations, 345,000 line km of airborne gravity, and 106,000 line km of gradiometry data, supplemented by a global gravity grid. Data collection spans from the 1940s to 2019, sourced from government, industry, and research organizations.
Public Accounts of Canada data lists payments for professional and special services aggregating to $100,000 or more to a single payee. This detailed listing includes the service classification, payee name and location, and total amount paid. The dataset is provided by Public Services and Procurement Canada and is updated annually, with the latest metadata from April 2026.