Loading...
Loading...
3D models, rendered datasets, physics simulation, digital twins, synthetic data generation, game engine data
1,021 datasets
NeuroData provides multiple neuroimaging datasets stored as Neuroglancer Precomputed Volumes. The collection spans multiple modalities and scales, from nanoscale electron microscopy to mesoscale structural and functional MRI. Many datasets include segmentations and meshes.
Actionbench is a benchmark of 128 paired video and animated point-cloud samples created by Meta (Facebook) for evaluating 3D mesh generation. Each record contains 16 RGBA frames with alpha masks and a corresponding animated point cloud consisting of 16 keyframes sampled from mesh surfaces.
August to September 2005 survey collected seafloor and subsurface data off Scotland's west coast aboard the RSS Charles Darwin. The British Geological Survey conducted the work across UK and Irish waters between 55–57°N and 8–14°W. Data types include bathymetry, seismic, magnetic, and gravity measurements.
A synthetic dataset for training hotword or keyword detection models. The dataset is published on Kaggle and is described as a training set for the 'Destiny' hotword. The specific data volume, creation date, and author are unknown.
10,000 rows of synthetic cafe sales data designed for data cleaning training. The dataset is synthetic and likely contains intentionally introduced errors or inconsistencies to simulate real-world messy data. Its author, organization, and license are unknown.
NVIDIA's SAGE-10k, released in February 2026, consists of 10,000 interactive indoor scenes generated through an agentic-driven pipeline for embodied AI research. The collection spans 50 room types and styles, featuring 565,000 uniquely generated 3D objects designed for interactive simulation.
pdftools provides utilities for extracting text, fonts, attachments, and metadata from PDF files. The tool also supports rendering PDF documents into image formats like PNG, JPEG, and TIFF, or into raw bitmap vectors for further processing in R. It is authored by Jeroen Ooms and is based on the 'libpoppler' library.
Large Scale Student Performance Synthetic Dataset is a synthetic dataset published on Kaggle. The raw description suggests it relates to big data infrastructure. The dataset's actual size, structure, and specific attributes are unknown from the provided metadata.
Yellow-HAR3D is a dataset of dense LiDAR point clouds. Published on Kaggle, its specific scale, collection method, and temporal coverage are not detailed in the available metadata. The dataset's content and potential applications must be verified after download.
CodeX-7M-Non-Thinking is a dataset curated from high-quality public sources and enhanced with synthetic data from both closed and open-source models. It is part of the CodeX lineup by Modotte, with a focus on providing data for model training and fine-tuning. The dataset was last updated on February 10, 2026.
Joint Nature Conservation Committee led the R/V Celtic Explorer cruise CE0705 from 4th to 18th June 2007. The survey acquired high-resolution multibeam, sub-bottom profiler, and camera data in the SW Approaches area, approximately 320km southwest of Land's End, to map morphology and investigate biological communities for Special Areas of Conservation assessment.
Julien Moeys developed 'The Soil Texture Wizard', a set of R functions for plotting, classifying, and transforming soil textures data. The package includes predefined texture triangles from more than 15 classification systems used around the world. It provides a graphical user interface and supports plotting into various triangle geometries.
A project by the British Geological Survey aims to generate high-resolution, accurate flow and transport simulation datasets for numerous geological realizations. These datasets will test upscaling methods from physical sciences literature and identify improved scaling laws for strongly heterogeneous systems. The work focuses on quantifying transport scaling in structures with high variance and strong textures observed in actual geological systems.
Ethiopian seismic data from station FURI, recorded over a decade, is used to demonstrate a new methodology for measuring shear-wave attenuation anisotropy. The package includes synthetic data, analysis codes, and derived measurements as described in Asplet et al 2024. The data is provided by the British Geological Survey (BGS) and was last updated in March 2026.
MeshGraphNetAirfoil is a dataset hosted on Kaggle. Its title suggests it contains data related to airfoil simulations, likely intended for use with MeshGraphNet, a graph neural network architecture. The dataset's specific contents, scale, and origin are not detailed in the available metadata.
Kaggle hosts a synthetic dataset designed for predicting admission outcomes to Ivy League universities. The dataset is described as realistic but synthetic, meaning it is artificially generated to mimic real-world admission data. Details about its creator, size, and specific features are unknown.
887,321 synthetic records across 423,883 unique queries form this dataset for advancing competitive programming. It is designed for supervised fine-tuning and curated by state-of-the-art reasoning models. The dataset was authored by IIGroup and last updated on February 7, 2026.
This dataset supports research on mitigating terrain shadows in very high-resolution satellite imagery for land cover mapping. It contains WorldView-2 multispectral and WorldView-1 panchromatic images, fused to improve evergreen conifer detection in temperate mixed mountain forests. The underlying research was published in the International Journal of Applied Earth Observation and Geoinformation in October 2024.
NeuralTextures likely contains synthetic image data generated for computer graphics and vision tasks. The dataset is hosted on Kaggle, but its specific contents, size, and creator are not detailed in the provided metadata. Users should download the dataset to verify its exact composition and scale.
A collection of data from a study investigating the use of virtual reality (VR)-based exposure therapy to reduce mobility-related anxiety in lower-limb prosthesis users (LLPU). The study compares responses of LLPU to those of control groups of young and older adults without prostheses across repeated exposures to anxiety-inducing, high-elevation VR settings. The project aims to identify targets for individualized rehabilitation to reduce fall risk and improve balance confidence.