Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
MentalBench is a benchmark dataset designed to evaluate the psychiatric diagnostic capabilities of large language models, created by author hysong. It provides a framework grounded in real-world psychiatric knowledge to test LLM reliability in a sensitive healthcare domain. The dataset was last updated on the platform in April 2026.
The full description and data details are hosted externally; users must visit the provided Hugging Face page for complete information.