Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
75% of 169 expected patient care actions were correctly recommended by a retrieval-augmented generation (RAG) large language model (LLM) grounded in a single EMS agency's protocols. The exploratory evaluation, authored by Colin G Wang and uploaded on 2026-05 07, tested the model's accuracy across six adult and pediatric prehospital scenarios. The study identified 42 missed actions, including 9 categorized as 'major misses'.
Files are in PDF and DOCX formats, not a structured data file. The dataset contains the study's documentation and results, not the raw clinical data or model outputs.