Fengjun Zhang published a dataset on figshare in May 2026 containing bioinformatics analysis results for Chronic Obstructive Pulmonary Disease (COPD). The dataset likely contains gene expression data and biomarker information derived from combined datasets GSE37768 and GSE38974. The file is 5.5 KB in size and is available in XLS format under a CC-BY-4.0 license.
Use Cases
- Validate diagnostic biomarkers for COPD based on identified genes ITGB2 and HNRNPAB.
- Analyze immune cell composition in COPD samples using CIBERSORT methodology mentioned in the description.
- Build machine learning models for early COPD diagnosis based on differential gene expression patterns.
- Investigate gene regulatory networks and pathway involvement for COPD-related targets.
- Perform Mendelian randomization analysis on COPD genetic data referenced in the study.
Strengths
- Dataset is derived from two combined gene expression datasets (GSE37768 and GSE38974).
- Results were validated using Polymerase Chain Reaction (PCR), immunohistochemistry (IHC), and immunofluorescence (IF) experiments.
- The dataset is licensed under CC-BY-4.0, allowing for open reuse.
Limitations
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count is unknown, which may limit suitability assessment.
- The dataset is very small (5.5 KB), indicating limited scope.
Provenance
- Source
- figshare
- Collection Method
- Data from combined datasets GSE37768 and GSE38974 were analyzed using differential gene expression analysis, weighted gene co-expression network analysis (WGCNA), functional enrichment analysis, and machine learning techniques.
- Freshness
- Last updated 2026-05-21 17:29:50; freshness should be verified.