Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
1,789,998 unique medical samples containing 3.78 billion tokens of chain-of-thought reasoning, released by OpenMed in February 2026. The dataset aggregates outputs from seven state-of-the-art AI models using fair distribution deduplication to ensure sample variety. It specifically focuses on clinical logic, with 1.56 billion tokens dedicated exclusively to reasoning steps.
Distributed under the Apache 2.0 license; users should be aware that the content is model-generated and requires validation for clinical accuracy before use in sensitive environments.