Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A collection of clinical vignettes constructed from participants' lived experience for evaluating large language models in psychosocial risk assessment. The dataset, authored by Laura M. Vowels and shared on figshare, contains examples used to benchmark GPT-4 and Claude 3 Sonnet on detecting suicide risk, intimate partner violence, and substance misuse. It was last updated on April 27, 2026.
License is CC-BY-4.0, requiring attribution.