DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Psychiatric Diagnostic Benchmark For Large Language Models | DataSalon

Home Medical & ClinicalPsychiatric Diagnostic Benchmark For Large Language Models

Medical & Clinical

Psychiatric Diagnostic Benchmark For Large Language Models

Name: Psychiatric Diagnostic Benchmark For Large Language Models
Creator: hysong
Published: 2026-04-06T06:12:43
Keywords: Benchmark, Llm Evaluation, Text, Diagnostic Benchmark, Psychiatry

by hysong·Updated 2mo ago

Available on 1 platform

Description

MentalBench is a benchmark dataset designed to evaluate the psychiatric diagnostic capabilities of large language models, created by author hysong. It provides a framework grounded in real-world psychiatric knowledge to test LLM reliability in a sensitive healthcare domain. The dataset was last updated on the platform in April 2026.

Use Cases

Benchmarking LLM performance on psychiatric diagnostic questions using the benchmark's evaluation framework.
Analyzing model outputs for specific psychiatric conditions referenced in the benchmark's test cases.
Evaluating potential biases in LLM-generated diagnostic suggestions based on the provided real-world knowledge grounding.

Strengths

Evaluation framework is grounded in real-world psychiatric knowledge.
Dataset was updated on the platform in April 2026.

Limitations

Specific sample size, column details, and data volume are unknown.
Geographic and demographic coverage of the underlying psychiatric knowledge is unspecified.

Provenance

Source: hysong on Hugging Face.
Collection Method: Constructed as an evaluation benchmark, likely from psychiatric knowledge sources.
Freshness: Last updated on the platform in April 2026.

The full description and data details are hosted externally; users must visit the provided Hugging Face page for complete information.

Text Benchmark Llm Evaluation Diagnostic Benchmark Psychiatry

Related Datasets

Quality Score

D36

Description

Source

Reputation

Quality Score

D36

Description

Source

Reputation

Access

Community

3 likes

0 views

Dataset Info

Author: hysong
Created: Apr 6, 2026
Updated: Apr 6, 2026
Last synced: May 26, 2026

Access

Community

3 likes

0 views

Dataset Info

Author: hysong
Created: Apr 6, 2026
Updated: Apr 6, 2026
Last synced: May 26, 2026

Psychiatric Diagnostic Benchmark For Large Language Models

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info