Igor Dolgalev provides the Molecular Signatures Database (MSigDB) gene sets as an R data frame. The package includes human genes and corresponding symbols and IDs for frequently studied model organisms such as mouse, rat, pig, fly, and yeast. The gene sets are typically used with the Gene Set Enrichment Analysis (GSEA) software.
Use Cases
- Perform gene set enrichment analysis (GSEA) based on the provided MSigDB gene sets.
- Cross-reference gene symbols and IDs across multiple model organisms based on the included organism mappings.
- Integrate curated gene sets into bioinformatics pipelines based on the tidy R data frame format.
Strengths
- Includes gene sets from the widely-used Molecular Signatures Database (MSigDB).
- Provides gene symbol and ID mappings for multiple model organisms, including mouse, rat, pig, fly, and yeast.
- Formatted as a tidy R data frame, which likely facilitates integration with the R/tidyverse ecosystem.
Limitations
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count is unknown, which may limit suitability assessment.
- Last update date is unknown; freshness unverified.
Provenance
- Source
- Molecular Signatures Database (MSigDB)
- Collection Method
- Packaged as an R data frame by Igor Dolgalev.
- Time Range
- null
- Freshness
- Last updated date is unknown.
- Geography
- null