161 records contain 4,153 pages of declassified U.S. Department of War documents on UFO/UAP phenomena, re-extracted into cleaned Markdown with inline image captions. The dataset includes per-page JPEG renders and interactive 3D atlas components, representing data derived from 80 years of declassified material. All data is released under a CC0 license by author alex-zhang42, with a version dated 2026-05-08.
Use Cases
- Multimodal topic modeling based on interleaved image-text data from declassified documents.
- Historical timeline analysis based on 80 years of government document releases.
- Optical character recognition (OCR) and document structure analysis based on per-page JPEG renders.
- Training models to caption or describe historical photographs, sketches, and handwritten notes mentioned in the description.
Strengths
- Contains 4,153 pages of primary source material.
- Includes approximately 2 GB of image data alongside text.
- Covers an 80-year time range of declassified government documents.
- Released under a permissive CC0 license.
Limitations
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count is unknown, which may limit suitability assessment.
- Freshness should be verified as the last update was 2026-05-09.
Provenance
- Source
- U.S. Department of War PURSUE Release 01 UFO / UAP declassification.
- Collection Method
- Re-extracted into cleaned Markdown with inline image captions.
- Time Range
- Data derived from documents spanning 80 years.
- Freshness
- Last updated 2026-05-09 06:40:22.