Data Sheet 1_Evolutionary origin and asymmetric subgenomic retention of the lncRNA pGhFAD2
by Haihong Chen·Updated 1mo ago
47.5 KB1files
Available on 1 platform
Sign in to view source links and access this dataset
Description
Haihong Chen's research document, last updated May 8, 2026, investigates the evolutionary origin of the long non-coding RNA pGhFAD2-1 in cotton. The study uses a cross-genomic comparative strategy across 27 Gossypium species representing eight genomes (A, B, D, E, F, G, K, AD). It reveals the lncRNA originated from a tandem duplication event approximately 5 million years ago, followed by a 1,221-bp insertion that abolished protein-coding capacity.
Use Cases
Study lineage-specific lncRNA evolution based on the comparative analysis across 27 Gossypium species.
Analyze the impact of structural variations like the 1,221-bp insertion on gene function and regulatory networks.
Investigate subgenomic conservation patterns in allopolyploid crops based on the D-genome-specific retention described.
Research the role of purifying selection in maintaining functional domains in non-coding RNAs.
Strengths
The study is based on a defined comparative analysis across 27 Gossypium species.
Provides specific evolutionary timelines, such as an origin event approximately 5 million years ago.
Includes a concrete molecular event description: a 1,221-bp exogenous sequence insertion.
Released under a permissive CC-BY-4.0 license for reuse.
Limitations
The dataset is a 47.5 KB DOC file, indicating a very limited scope, likely a manuscript or supplementary document rather than a primary data repository.
Row count and column-level documentation are absent; data structure and semantics must be inferred from the document text.
The description is highly specific to a single lncRNA, limiting generalizability.