DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Computer Graphics & Simulation Datasets | DataSalon

All Categories

🎨

Computer Graphics & Simulation

3D models, rendered datasets, physics simulation, digital twins, synthetic data generation, game engine data

1,034 datasets

Monthly Ocean Current Statistics for Japan Seas, 1953-1994

SCIOP's dataset provides monthly statistical summaries of surface ocean currents in seas adjacent to Japan from 1953 to 1994. The data, derived from GEK and ADCP instruments, is aggregated into 1-degree latitude/longitude grids. Each grid includes mean speed, mean direction, sample count, maximum/minimum current, and stability.

TabularAudioTime SeriesGeospatialMarine DataMarine ScienceSurface CurrentsHistorical DataStatisticsGeospatial StatisticsOcean Currents+1

0 views

Computer Graphics & Simulation

BlockGen-3D: Large-Scale Voxelized 3D Models with Text Descriptions

BlockGen-3D is a large-scale dataset of voxelized 3D models with accompanying text descriptions, designed for text-to-3D generation tasks. The dataset was created by author PeterAM4, who processed and voxelized models from the Objaverse dataset to create a standardized representation suitable for training 3D diffusion models. It was last updated on January 9,我们发现了一个错误。

MultimodalVoxel DataComputer VisionText to 3DLarge Scale3d-modelsDiffusion Models+1

0 views

Computer Graphics & Simulation

PartObjaverse-Tiny: 200 Complex 3D Objects with Part Annotations

PartObjaverse-Tiny is a 3D part segmentation dataset providing detailed semantic-level and instance-level part annotations for 200 complex 3D objects. It was created by yhyang-myron and was last updated on December 13, 2024. The dataset includes mesh files and corresponding ground truth annotation files.

Point CloudPart SegmentationSemantic Annotation3d ObjectsComputer Vision+1

0 views

Computer Graphics & Simulation

S2O: Static to Openable Enhancement for Articulated 3D Objects

3dlg-hcvc provides mesh, point cloud, and metadata for two datasets used in the S2O research project. The PM-Openable subset contains 648 openable objects from PartNet-Mobility, with a train/val/test split of 460/95/93 objects. The Articulated Container Dataset (ACD) contains openable container objects sourced from HSSD.

Point CloudMultimodal3d ObjectsArticulated ObjectsComputer VisionMesh Data+1

0 views

Computer Graphics & Simulation

Describable Textures Dataset: Textural Images with Human-Centric Attributes

The Describable Textures Dataset (DTD) is an evolving collection of textural images annotated with human-centric perceptual attributes. It is made available to the computer vision community for research purposes by the Visual Geometry Group at the University of Oxford. The dataset was last updated on the Hugging Face platform on 2023-05-11.

ImagePerceptual AttributesComputer VisionTexture Recognition+1

0 views

Computer Graphics & Simulation

Objaverse-XL: Over 10 Million 3D Objects for AI Training

Over 10 million 3D objects form this open dataset, which is more than an order of magnitude larger than its predecessor. AllenAI released Objaverse-XL in 2023 to train the Zero123-XL foundation model for 3D tasks. The dataset is described as being much more diverse than the earlier 800K-object Objaverse 1.0.

Point CloudMultimodalLanguageen3d ObjectsFoundation ModelsRegionusLarge ScaleLicenseodc BySynthetic DataComputer Graphics+1

0 views

Computer Graphics & Simulation

Ling-Coder-DPO: 250k Samples for Code Model Preference Tuning

Ling-Coder-DPO is a subset of 250,000 samples used for Direct Preference Optimization (DPO) training of the Ling-Coder Lite model. The dataset was created by inclusionAI and last updated on Hugging Face on March 27, 2025. It is part of a larger collection that also includes a supervised fine-tuning (SFT) subset with over 5 million samples and a synthetic question-answering subset.

TextAi TrainingCode GenerationLarge ScaleSynthetic DataSynthetic+1

0 views

Computer Graphics & Simulation

Urban Development Plan for Blumenstrasse Im Almeshofen, Puettlingen, Germany

A 2024 geospatial dataset from the German Federal Agency for Cartography and Geodesy. It contains development plans and surrounding areas for the Blumenstrasse Im Almeshofen site in the Herchenbach district of Puettlingen, Saarland. The data is provided as a Web Map Service (WMS) layer under a CC0-1.0 license.

Geospatial🇩🇪 GermanyDevelopment PlansUrban PlanningMunicipal Data+1

0 views

Computer Graphics & Simulation

APISR: Anime Super Resolution Training Dataset

An image dataset for training models in anime super-resolution, created by HikariDawn. The dataset is associated with a research paper and a Gradio demo. It was last updated on October 24, 2025.

ImageTraining DataAnime Super ResolutionComputer Vision+1

0 views

Computer Graphics & Simulation

Weighted Mean Salinity from Delft3D FM Simulations for the Mississippi River Delta

Weighted average salinity outputs from two 31-day Delft3D Flexible Mesh simulations representing low and high discharge seasons in the Mississippi River Delta. The dataset, produced by ORNL_CLOUD and published via NASA EarthData, models conditions from fall and spring 2021. Data is provided in netCDF format, with each model's contribution weighted by the probability density function of Atchafalaya River discharge.

Time SeriesGeospatialSalinityHydrological ModelingRiver DischargeCoastal Dynamics+1

0 views

Computer Graphics & Simulation

Refusal XL: 16,000 Synthetic Instruction-Refusal Pairs

16,000 single-turn conversations form this synthetic dataset of instruction and refusal pairs. The dataset was created by author mrfakename and last updated on 2024-04 26. Human prompts are sourced from the Capybara dataset, with refusals generated synthetically.

TextInstruction ResponseNlp TrainingRefusal GenerationSynthetic DataSynthetic+1

0 views

Computer Graphics & Simulation

Synthetic Electrical Network Models for San Francisco, Greensboro, and Austin

Synthetic Models for Advanced, Realistic Testing: Distribution systems and Scenarios (SMART-DS) provides realistic large-scale U.S. electrical distribution models for three metropolitan areas: San Francisco (SFO), Greensboro (GSO), and Austin (AUS). The dataset contains detailed network models and connected time-series loads, validated against thousands of utility feeders for operational similarity. It is intended for powerflow simulations under various scenarios.

ParquetRealisticBig DataEnergy Systems IntegrationPowerflowElectricalSfoGrid ModernizationDistribution SystemAusScenarioLoad TimeseriesGridElectrical NetworkSmart DsPowerEnergyDistributionGsoOpendss+1

0 views

Computer Graphics & Simulation

Aria Synthetic Indoor Scenes for 3D Understanding

100,000 procedurally-generated indoor scenes comprise this synthetic dataset. It was created by projectaria for research on 3D scene understanding, object detection, and tracking, with a last update in September 2024. The dataset simulates sensor data matching the characteristics of Project Aria glasses.

Point CloudMultimodal3d Scene UnderstandingComputer VisionObject DetectionLarge ScaleSynthetic DataSynthetic+1

0 views

Computer Graphics & Simulation

ShapeNetSem: 3D Object Models Annotated with Physical Attributes

ShapeNetSem is a subset of the ShapeNet repository, containing 3D models with rich physical attribute annotations. The archive is hosted by ShapeNet and was last updated on Hugging Face in September 2023. Users must agree to specific terms of use, restricting redistribution to research associates who also agree to the terms.

Point Cloud3 D ShapesLicenseotherLanguageenPhysical AttributesComputer VisionRegionusArxiv1512030123D shapes+1

0 views

Computer Graphics & Simulation

WebSight: 1-10 Million Synthetic Website Screenshots and Code Pairs

WebSight contains between 1 and 10 million pairs of synthetic website screenshots and their corresponding HTML/CSS code, released by HuggingFaceM4 in March 2024. The collection features two distinct versions covering standard HTML/CSS and modern HTML/Tailwind CSS implementations for English-language websites.

ParquetArxiv240309029LibrarypolarsLibrarydaskSize Categories1 Mn10 MLanguageenModalitytextCodeLibrarymlcroissantModalityimageLibrarydatasetsLicensecc By 40RegionusSynthetic+1

0 views

Computer Graphics & Simulation

ShapeSplatsV1: 52,000 3D Objects as Gaussian Splats from ShapeNetCore

ShapeSplatsV1 is a dataset of 52,000 3D objects across 55 categories, derived from the ShapeNetCore repository. The data is distributed as PLY files where each Gaussian splat's information is encoded in custom vertex attributes. The dataset was created by ShapeNet and last updated on Hugging Face in September 2024.

Point Cloud3d ObjectsGaussian SplatsComputer GraphicsShape Net+1

0 views

Computer Graphics & Simulation

Gulf of Alaska Zooplankton Abundance from MOCNESS Trawls

MOCNESS trawl data captures zooplankton species abundance and biomass in the Gulf of Alaska from 1997 to 2004. The dataset, part of the Gulf of Alaska Long-Term Observation Program, was collected by SCIOPS using 1 meter-square nets with 5 mm mesh on oblique hauls. It provides a multi-year record for ecological analysis.

TabularTime SeriesLong Term Ecological DataZooplankton AbundanceGulf Of AlaskaMarine BiologyEcological Monitoring+1

0 views

Computer Graphics & Simulation

Global Plankton Taxa and Counts from Net and Trap Collections, 1928-1987

Plankton counts and taxonomic data were collected over a 59-year period from 1928 to 1987 using nets and traps on vessels worldwide. The Smithsonian Oceanographic Sorting Center compiled these records, which include gear specifications like net mouth diameter and mesh size. NOAA's National Centers for Environmental Information (NCEI) holds the dataset, which was submitted for archival in 1994.

TabularOceanographyHistorical DataMarine Biology+1

0 views

Computer Graphics & Simulation

3DWF: 3D Face Point Clouds with RGB-D Images and Subject Demographics

A collection of RGB-D camera captures from 92 subjects changing their pose based on 10 markers. The dataset includes images, depth maps, rotation and translation matrices for registration, reconstructed 2K-point clouds, high-definition initial point clouds, and subject characterizations by age, gender, and ethnicity. It was authored by Marcos Quintana González and last updated in May 2024.

ImagePoint CloudRgb DBiometricsComputer Vision3d Faces+1

0 views

Computer Graphics & Simulation

Depth Perception Data from Naturalistic Images with Simulated Blur

Featuring experimental data from a study examining depth perception in images of real scenes. The study manipulated pictorial depth cues, simulated dioptric blur, and binocular disparity, using light field photographs captured with a Lytro plenoptic camera capable of capturing images at up to 12 focal planes. Observers performed 2AFC tasks to indicate which of two patches extracted from these images was farther.

0 views

PreviousPage 47 of 52Next