Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A 2026 study by Yaoyan Lu analyzes a public dataset of 1,879 individuals to improve type 2 diabetes risk prediction. It integrates structured clinical variables like BMI and HbA1c with unstructured medical text processed via a BERT-based NLP pipeline. The hybrid model's performance was validated on a post-2020 cohort of 939 individuals.
The primary file is a 447.3 KB PDF describing the study and methodology; the actual dataset files may not be included and would need to be located separately.