Sign in to view source links and access this dataset
Description
5,000 AI-generated German passport images created for training optical character recognition and computer vision models. Each image is captured across 3 angles, 4 lighting conditions, 4 backgrounds, and 2 distances, with structured metadata covering passport ID, gender, and age group. The dataset was created by ud-synthetic and was last updated on 2026-05-03.
Use Cases
Train optical character recognition models based on synthetic passport text fields.
Develop computer vision models for document detection and classification based on varied angles and backgrounds.
Benchmark model robustness to lighting variations based on the 4 lighting conditions per image.
Create privacy-preserving training pipelines based on the use of synthetic, non-personal data.
Strengths
5,000 AI-generated images ensure no real personal data is used.
Each image has multiple variations covering 3 angles, 4 lighting conditions, 4 backgrounds, and 2 distances.
Structured metadata includes fields like passport ID, gender, and age group.
Limitations
Column-level documentation is absent; field semantics must be inferred after download.
Row count is unknown, which may limit suitability assessment.
Last updated 2026-05-03 19:51:48; freshness should be verified.
Provenance
Source
huggingface
Collection Method
AI-generated synthetic data.
Freshness
2026-05-03
License is unknown and should be verified before use.