Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A 2026 benchmark evaluates foundational interdisciplinary competencies in Traditional Chinese Medicine Informatics (TCMI). Zhaohang Teng constructed the TCMI-F-6D benchmark from six core disciplines in the MMLU dataset to assess 20 large language models. The study provides a quantifiable framework for model evaluation in TCMI-related scenarios.
Primary data is embedded within a DOCX file; users must extract tables or metrics from the document. License is CC-BY-4.0.