Sign in to view source links and access this dataset
Description
Official ERYON ingestion and preprocessing repository for medical data. The repository contains scripts for processing raw DICOM files, whole slide images (WSIs), and genomics archives into intermediate formats like PNGs, tiles, and embeddings. It was authored by Chucks90 and last updated on June 2, 2026.
Use Cases
Develop medical image preprocessing workflows based on the described scripts for converting DICOM to PNG.
Create tiling and embedding pipelines for whole slide images (WSIs) based on the described storage architecture.
Build multimodal data integration systems based on the combined raw data for DICOM, WSIs, and genomics.
Test data ingestion and transformation scripts based on the provided repository structure for medical AI projects.
Strengths
Repository structure is explicitly documented with a clear storage architecture for raw, interim, and processed data.
The description specifies support for multiple medical data modalities: DICOM, whole slide images (WSIs), and genomics.
Limitations
Description metadata is limited; actual data quality requires manual inspection after download.
Column-level documentation is absent; field semantics must be inferred after download.
Row count is unknown, which may limit suitability assessment.
Provenance
Source
huggingface
Collection Method
Official ingestion and preprocessing scripts for ERYON.
Freshness
Last updated 2026-06-02 16:34:06; freshness should be verified.
License is unknown; terms of use must be verified before application.