Sign in to view source links and access this dataset
Description
AI-generated images of USA passports designed for training machine learning models. The dataset contains 9,600 synthetic images with varied angles, lighting, and backgrounds, created by ud-synthetic. It was last updated on May 3, 2026.
Use Cases
Train optical character recognition (OCR) models based on synthetic passport images.
Develop computer vision models for document detection and alignment based on varied angles and backgrounds.
Benchmark model robustness to different lighting conditions and distances mentioned in the description.
Create privacy-compliant training pipelines for identity verification systems using synthetic data.
Strengths
Contains 9,600 AI-generated images, providing a substantial volume for model training.
Includes structured metadata covering attributes like gender and age group.
All images are synthetically generated, ensuring no real personal data is used and addressing privacy concerns.
Images feature varied conditions such as angles, lighting, and backgrounds to aid model generalization.
Limitations
Column-level documentation is absent; field semantics must be inferred after download.
Row count is unknown, which may limit suitability assessment.
Last updated 2026-05-03 19:45:50; freshness should be verified.
Provenance
Source
ud-synthetic on Hugging Face
Collection Method
AI-generated synthetic data
Freshness
2026-05-03 19:45:50
Geography
USA (document format)
License is unknown and should be verified before use.