Compass is a collection of processed 16S and shotgun-derived microbiome tables formatted for machine learning. The dataset is organized into benchmark tasks, each with a specific configuration. The author is outpost-bio, and the dataset was last updated on May 12, 2026.
Use Cases
- Benchmarking microbiome machine learning models based on the structured task configurations.
- Training models for microbial community analysis based on processed 16S rRNA data.
- Training models for microbial community analysis based on processed shotgun metagenomic data.
- Evaluating model performance across different microbiome data splits (train, validation, test).
- Developing foundation models for microbiome research based on the dataset's stated purpose.
Strengths
- Dataset is structured for machine learning with dedicated train, validation, and test splits.
- Dataset was last updated on May 12, 2026, indicating recent maintenance.
Limitations
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count is unknown, which may limit suitability assessment.
- Description metadata is limited; actual data quality requires manual inspection after download.
Provenance
- Source
- outpost-bio
- Collection Method
- Processed from 16S and shotgun-derived microbiome data.
- Freshness
- Last updated 2026-05-12 12:35:45.