DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Computer Graphics & Simulation Datasets | DataSalon

All Categories

🎨

Computer Graphics & Simulation

3D models, rendered datasets, physics simulation, digital twins, synthetic data generation, game engine data

1,028 datasets

Synthetic E-commerce Behavior Dataset

Synthetic e-commerce behavior data facilitates machine learning classification and feature engineering practice within a tabular format. Hosted on Kaggle, the dataset is structured for intermediate-level data cleaning and storytelling tasks. It lacks public documentation regarding specific record counts or column definitions.

TabularE Commerce ServicesIntermediateData StorytellingData Cleaning+1

0 views

Computer Graphics & Simulation

Industrial Point Cloud (Part 01)

Synthetic point cloud data across multimodal industrial categories. This dataset serves as the first installment of a series for 3D computer vision research in manufacturing environments.

SimulationsObject DetectionSegmentationSynthetic+1

0 views

Computer Graphics & Simulation

Synthetic Employee Data for People Analytics

Synthetic data designed for practicing people analytics workflows. The dataset contains artificial employee records for modeling HR scenarios without privacy concerns. Specific row counts, column details, and authorship are not provided.

Synthetic+1

0 views

Computer Graphics & Simulation

AI Agent Security Threat Classification Data

Described as a security mesh for AI agents and is tagged for text classification tasks. The number of rows, columns, and specific data fields are unknown.

TextPythonClassification+1

0 views

Computer Graphics & Simulation

GenS-Video-150K: Synthetic Video Frames Annotated for Sampling

A synthetic dataset of 150,000 video frames annotated by GPT-4o for training frame sampling models. It features dense coverage, annotating approximately 20% of all frames with relevance scores, and provides fine-grained confidence assessments on a 1 to 5 scale. The dataset was created by author yaolily and last updated on September 4, 2025.

VideoRelevance ScoringVideo Frame SamplingComputer VisionLarge ScaleSynthetic DataSynthetic+1

0 views

Computer Graphics & Simulation

Synthetic Dataset for Machine Learning Simulation

Synthetic Dataset is a dataset published on Kaggle. The dataset's content, size, and specific features are not described in the available metadata. Its creation method and intended application are inferred from its title and platform tag.

TabularSimulationSynthetic DataSyntheticComputer Graphics+1

0 views

Computer Graphics & Simulation

Proobjaverse 300K: A Large-Scale Image-to-3D Dataset

Proobjaverse 300K is a dataset published on huggingface by Stable-X, last updated on 2026-01 27. The title and platform tags suggest it contains a large collection of images, likely 300,000 items, for tasks related to image-to-image and image-to-3D processing. Its specific content, columns, and file formats are not detailed in the provided metadata.

ImageMultimodalSize Categories10 Kn100 KImage to 3DTask Categoriesimage To 3dImage To ImageLanguageenRegionus3d-reconstructionTask Categoriesimage To ImageLicenseapache 20Computer Graphics+1

0 views

Computer Graphics & Simulation

Uzbek Telegram-Style Messages for Spam Detection

Featuring 2000 Uzbek text messages labeled as 'spam' or 'normal'. It is designed for training spam detection models, with a split of 1800 training and 200 test samples.

JSONSize Categories1 Kn10 KLibrarypolarsLanguageuzSpam DetectionModalitytextTask Idsmulti Class ClassificationLow Resource NlpLibrarymlcroissantLibrarydatasetsLibrarypandasTelegramRegionusTask Categoriestext ClassificationSynthetic DataLicensemitUzbek+1

0 views

Computer Graphics & Simulation

Novae: Spatial Transcriptomics and Protein Samples for Model Training

MICS-Lab released a full dataset for the Novae project on December 10, 2025. The collection includes spatial transcriptomics samples used to train the Novae model, protein samples referenced in the associated article, and some Visium and Visium HD samples. It also contains synthetic data samples.

ImageMultimodalProtein DataBioinformaticsComputer VisionSpatial TranscriptomicsSynthetic DataSynthetic+1

0 views

Computer Graphics & Simulation

Photogrammetric Snow Depth Maps from Multiple Platforms in Davos

Snow depth maps and validation measurements from a 2018 intercomparison study of photogrammetric platforms in the Dischma valley, Switzerland. ENVIDAT provides this data set, which includes products from satellite, airplane, UAS, and terrestrial platforms. The study was conducted in spring 2018.

ImageGeospatialGeospatial MappingAlpine EnvironmentPhotogrammetrySnow Depth+1

0 views

Computer Graphics & Simulation

Canada Basin Zooplankton Species Abundance and Biomass from 2002

August 2002 data contains zooplankton samples from 10 stations in the Canada Basin, using 53 and 236 µm mesh nets. The database includes 1164 rows documenting 30 species, with analysis of numerical dominance and biomass contributions. It was collected by the organization SCIOPS for ocean exploration research.

TabularZooplanktonSpecies AbundanceArctic OceanBiomassMarine Biology+1

0 views

Computer Graphics & Simulation

Neuston Tow Zooplankton Data from Oregon and California Coastal Cruises

Zooplankton samples were collected using a neuston net during four juvenile salmonid trawling cruises off the coasts of Oregon and California. The dataset, created by SCIOPS for the GLOBEC NEP Process Study, covers two sampling years, 2000 and 2002. Data includes genus/species-level identification with life stage and abundance information.

TabularZooplanktonCoastal EcologySalmonid SurveyMarine Biology+1

0 views

Computer Graphics & Simulation

DensePose-COCO: Human Image-to-Surface Correspondence Annotations

DensePose-COCO is a large-scale ground-truth dataset with image-to-surface correspondences manually annotated on COCO images. It contains 33,929 samples and was created by Voxel51. The dataset was last updated on the Hugging Face platform in June 2024.

ImageSize Categories10 Kn100 KLibraryfiftyoneTask Categoriesobject DetectionLanguageenLicensecc By Nc 20ModalityimageHuman Pose EstimationComputer VisionObject DetectionKeypointsRegionusLarge ScaleArxiv180200434Image AnnotationFiftyone+1

0 views

Computer Graphics & Simulation

Synthetic Dataset for AI-Assisted Mass Customization

A synthetic dataset likely related to mass customization processes. The dataset is hosted on Kaggle and is tagged as 'Synthetic'. Specific details on volume, features, creation method, and authorship are not provided in the metadata.

TabularProduct DesignAI-assistedMass CustomizationSynthetic DataManufacturingSynthetic+1

0 views

Computer Graphics & Simulation

Karenia Brevis Counts and Biochemistry from Florida Coastal Cruise 1998

Water bottle samples collected from 14 stations in Florida during a November 1998 cruise provide counts and biochemical analysis of the harmful algae Karenia brevis. Coulter counts for the 14-28 um size class were determined, and isolated algae pellets were analyzed for total lipid, neutral lipid, free amino acids, protein, RNA, chlorophyll, and nitrate. The dataset was produced by Kamykowski's NCSU laboratory for NOAA NCEI.

TabularHarmful Algal BloomsOceanographyWater QualityBiochemistryMarine Biology+1

0 views

Computer Graphics & Simulation

Digital Twin PTB-XL: A Physics-Based Simulation Dataset

Digital Twin PTB-XL is a dataset published on Kaggle. The dataset likely contains data for physics-based modeling and simulation, given the 'Digital Twin' concept in its title. Specific details regarding its size, origin, and creation date are not provided in the available metadata.

TabularDigital TwinPhysics Based ModelingSimulation+1

0 views

Computer Graphics & Simulation

NSD S1: Selected Voxel Data for Train/Val Split

The dataset title 'NSD S1 train val NC selected voxels 70' suggests it contains data from the Natural Scenes Dataset (NSD) project. It likely includes selected voxel data from 70 subjects for training and validation splits. The data is hosted on Kaggle, but detailed metadata is unavailable.

TabularVoxel DataFmriNeuroscienceBrain Imaging+1

0 views

Computer Graphics & Simulation

NSD S2: Selected Voxel Data for Training and Validation

NSD S2 likely contains processed neuroimaging data from the Natural Scenes Dataset. The dataset appears to be a subset of voxel data curated for machine learning training and validation purposes. Published on Kaggle, its specific content and scale require verification after download.

TabularFunctional MriVoxel DataNeuroimagingBrain Scan+1

0 views

Computer Graphics & Simulation

Zomato Cart Add-On Synthetic Dataset

A synthetic dataset likely modeling customer interactions with add-on items in a food delivery cart, sourced from Kaggle. The dataset's specific size, creator, and temporal coverage are not provided in the metadata. Its content and structure must be verified after download.

TabularE CommerceFood DeliveryCart AnalysisSynthetic DataSynthetic+1

0 views

Computer Graphics & Simulation

NSD S6: Voxel Activity Training Data for Natural Scenes

NSD S6 val train NC selected voxels 70 is a dataset hosted on Kaggle. The title suggests it contains selected voxel data, likely from functional magnetic resonance imaging (fMRI), for training models related to the Natural Scenes Dataset. The specific content, scale, and origin require verification after download.

TabularVoxel DataFmriNeuroscienceNeural ActivityBrain Imaging+1

0 views

PreviousPage 39 of 52Next