CuriaBench is a collection of evaluation datasets for the Curia foundation model, as described in the associated research paper. The datasets were created by the organization 'raidium' and the benchmark repository was last updated on March 31, III. The data is intended to assess the performance of multimodal AI models in radiology.
Use Cases
- Benchmarking multimodal foundation models based on radiology data mentioned in the description
- Evaluating model performance on medical imaging tasks referenced in the associated paper
- Comparing AI model outputs against established radiology evaluation criteria
Strengths
- Directly linked to a peer-reviewed research paper (arXiv:2509.06830)
- Associated with a specific, named foundation model (Curia-B) available on Hugging Face
Limitations
- Column-level documentation is absent; field semantics must be inferred after download
- Row count is unknown, which may limit suitability assessment
- Description metadata is limited; actual data quality requires manual inspection after download
Provenance
- Source
- raidium
- Collection Method
- Created for evaluating the Curia foundation model, as per the associated paper.
- Time Range
- null
- Freshness
- Last updated 2026-03-31 14:30:10; freshness should be verified
- Geography
- null