Kaggle hosts this dataset titled OmicsExpressionProteinCodingGenesTPMLogp1. The title suggests it contains omics expression data, likely transcript per million (TPM) values for protein-coding genes. The dataset's author, organization, and specific collection details are unknown.
Use Cases
- Train a model to classify biological samples based on gene expression profiles (inferred from domain, verify after download)
- Perform differential expression analysis between experimental conditions (inferred from domain, verify after download)
- Build a predictive model linking gene expression to phenotypic traits (inferred from domain, verify after download)
Strengths
- Published on Kaggle, a platform for sharing datasets.
Limitations
- Metadata is minimal; actual content requires verification after download.
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count is unknown, which may limit suitability assessment.