Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
9.6 MB of prompts and correctness scores for various large language model responses, compiled by Mary Cummings. The dataset is hosted on figshare and was last updated on 2026-05-26. It is released under a CC-BY-4.0 license.
Data is in PDF format, which may require conversion or specialized parsing tools for analysis.