Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
25,949 normalized rows form a benchmark for evaluating end-to-end medical question-answering systems. It was originally built to evaluate MAMAI, a RAG chatbot for nurses and midwives in Zanzibar. The dataset, authored by nmrenyi, includes multiple-choice and open-ended QA tracks.
License is unknown; terms of use must be verified.