Betula Platyphylla ELIP and LHC Protein Sequence Data and Structural Models
by preetom regon·Updated 1mo ago
1.0 MB11files
Available on 1 platform
Sign in to view source links and access this dataset
Description
1.0 MB of sequence data, structural models, and analysis outputs for Early Light-Inducible Proteins (ELIPs) and the Light-Harvesting Complex (LHC) superfamily in Betula platyphylla. The dataset was authored by preetom regon and last updated on 2026-05-03. Files include PDF, CIF, TXT, FA, HMM, FASTA, and TREEFILE formats.
Use Cases
Predicting protein-DNA interactions based on sequence data and structural models.
Comparative genomic analysis of the Light-Harvesting Complex (LHC) superfamily.
Building phylogenetic trees using the provided TREEFILE data.
Training or validating protein structure prediction models with the CIF structural files.
Strengths
Data is openly licensed under CC-BY-4.0.
Includes multiple analysis-ready file formats like FASTA and HMM.
Contains structural models in CIF format alongside sequence data.
Limitations
Row count is unknown, which may limit suitability assessment.
Column-level documentation is absent; field semantics must be inferred after download.
Description metadata is limited; actual data quality requires manual inspection after download.
Provenance
Source
figshare
Collection Method
Associated with a study on ELIPs and the LHC superfamily; specific collection method is not detailed.
Freshness
Last updated 2026-05-03 07:27:41; freshness should be verified.
The 1.0 MB size indicates a small dataset, which may limit the scope of analysis.