Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A research dataset of 1,879 individuals used to investigate integrating natural language processing with traditional clinical data for type 2 diabetes risk prediction. The dataset includes structured variables like BMI and HbA1c alongside unstructured textual entries such as symptom descriptions and lifestyle notes. It was created by Yaoyan Lu and last updated on 2026-05-28.
The primary file format is DOCX, which likely contains the research manuscript; the actual underlying dataset files may need to be located separately.