Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Training corpus for GO-GPT, an autoregressive transformer model for Gene Ontology term prediction. It contains proteins annotated with GO terms, InterPro domains, STRING protein-protein interactions, and metadata sourced from UniProt.
The full dataset description, specific file formats, size, row count, and license details are not provided in the input and require checking the linked dataset page.