BiomniBench-DA-sample is a small representative subset of the full BiomniBench-DA benchmark created by phylobio. It contains data from three tasks, each from a different disease area, intended for reviewer inspection of dataset quality and structure. The full dataset comprises 50 released tasks with an additional 50 held-out tasks for evaluation.
Use Cases
- Inspect dataset structure and quality based on the representative sample description
- Evaluate task contents for different disease areas based on the description
- Assess suitability for contamination-resistant evaluation based on the held-out set mentioned
Strengths
- Contains a sample from three distinct disease areas, providing cross-domain representation
- Part of a larger benchmark with 50 released tasks and 50 held-out tasks for robust evaluation
Limitations
- Description metadata is limited; actual data quality requires manual inspection after download
- Column-level documentation is absent; field semantics must be inferred after download
- Row count is unknown, which may limit suitability assessment
Provenance
- Source
- phylobio
- Collection Method
- A sample selected from the 50-task BiomniBench-DA release.
- Freshness
- Last updated 2026-05-07 09:33:32