Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
943 diabetes patients' structured telehealth data was augmented with physical activity information extracted from their free-text notes over a 12-year period. Fabian Wiesmüller published this research in 2026, benchmarking local rule-based and Mistral LLM methods against GPT-4.1. The dataset includes 100 synthetically generated notes used for benchmarking the extraction algorithms.
The primary file is a 192.8 KB PDF, which likely contains the research paper and may not include the raw dataset in a directly machine-readable format.