Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Protenix Data is ByteDance's open-source, preprocessed training dataset for its Protenix model, a PyTorch reproduction of AlphaFold3. The dataset is described as one of the largest publicly available AF3-style training corpora, built from the wwPDB. LiteFold released the dataset on Hugging Face, with a last recorded update on 2026-05-27.
The dataset's license is unknown, which may impose usage restrictions.