Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A collection of standardized tasks for assessing mechanistic reasoning in AI agents, created by vida-nyu. The dataset provides experimental context, molecular signatures, and prompts to test an agent's ability to reconstruct explanations from peer-reviewed biological studies. It was last updated on February 27, 2026.
License is unknown; terms of use must be verified before application.