Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
GPT-4 annotated the clinical severity of over 17,500 phenotypic abnormalities in the Human Phenotype Ontology across nine clinical characteristics. The annotations were benchmarked against ground-truth labels with a mean true positive recall rate of 97%. This dataset, created by Kitty B. Murphy and last updated in May 2026, provides quantitative severity metrics for prioritizing therapeutic targets in rare diseases.
The dataset is very small (1.8 KB), indicating it likely contains aggregated results or summary scores rather than the full set of raw LLM annotations.